" EL^27flM7b5L)s) 

ATAXIA ASSOCIATED GENE AND PROTEIN 

This application claims priority to provisional patent applications serial number 
60/422,971, filed 1 1/01/02 and 60/424,973, filed 1 1/08/02, each of which is herein 
5 incorporated by reference in its entirety. 

This invention was made in part with government support under Grant No. 
NS32130 awarded by the National Institutes of Health. The Government may have 
certain rights in the invention. 

1 0 FIELD OF THE INVENTION 

The present invention relates to ataxia, in particular to protein and nucleic acids 
encoding proteins associated with, ataxia. The present invention provides assays for the 
detection of ataxia polymorphisms and mutations associated with disease or disease 
carrier states. 

15 

BACKGROUND OF THE INVENTION 

Hereditary ataxia is a group of rare genetic neuromuscular disorders. It is 
characterized by degenerative changes in the brain and spinal cord. It can affect a person 
anytime between infancy through adulthood. Major symptoms include lack of 

20 coordination of the muscles used for voluntary movement. Many different hereditary 
ataxia's have been characterized, including Friedreich's Ataxia, Marie's Ataxia, Ataxia 
Telangiectasia, Vasomotor Ataxia, Vestibulocerebellar, Ataxiadynamia, Ataxiophemia, 
Olivopontocerebellar Atrophy, and Charcot-Marie-Tooth Disease. 

Autosomal recessive Cayman cerebellar ataxia, or Cayman ataxia, was identified 

25 in a population isolate on Grand Cayman Island (Johnson et al., Neurology 28:352 
[1978]). This disorder is characterized by marked psychomotor retardation and 
prominent nonprogressive cerebellar dysfunction including nystagmus, intention tremor, 
dysarthria, and wide-based ataxic gait. Hypotonia is present from early childhood. 
Retinal abnormalities are absent. 

30 There is a need for identification of the molecular basis of ataxia, as well as for 

improved diagnostics and treatments for ataxia. 
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SUMMARY OF THE INVENTION 

The present invention relates to ataxia, in particular to protein and nucleic acids 
encoding proteins associated with ataxia. The present invention provides assays for the 
detection of ataxia polymorphisms and mutations associated with disease or disease 
carrier states. 

Accordingly, in some embodiments, the present invention provides a method for 
detection of a variant Cayman ataxia polypeptide or nucleic acid sequence in a subject, 
comprising providing a biological sample from a subject, wherein the biological sample 
comprises a Cayman ataxia polypeptide or nucleic acid; and detecting the presence or 
absence of a variant Cayman ataxia polypeptide or nucleic acid in the biological sample. 
In some embodiments, the variant Cayman ataxia polypeptide is a variant of SEQ ID 
NO:4 (e.g., SEQ ID NO:9). In some embodiments, the variant Cayman ataxia nucleic 
acid is a variant of SEQ ID NO:3 or SEQ ID NO: 1 1 (e.g., SEQ ID NOs: 8 and 10). In 
certain embodiments, the presence of the variant Cayman ataxia polypeptide or nucleic 
acid is indicative of Caymans ataxia in the subject. In other embodiments, the presence 
of the variant Cayman ataxia polypeptide or nucleic acid is indicative of the subject being 
a carrier for Cayman ataxia. In still further embodiments, the presence of the variant 
Cayman ataxia polypeptide or nucleic acid is indicative of a disorder selected from the 
group consisting of ataxia, myoclonus, dystonia, epilepsy, and nystagmus in the subject. 
In some embodiments, the biological sample is selected from the group consisting of a 
blood sample, a tissue sample, a urine sample, a saliva sample, and an amniotic fluid 
sample. In some embodiments, the subject is selected from the group consisting of an 
embryo, a fetus, a newborn animal, a young, animal, and an adult animal. In certain 
embodiments, the animal is a human (e.g., an adult female of child- bearing age). In 
some embodiments, the detecting comprises differential antibody binding. In other f 
embodiments, the detection comprises a Western blot. In still further embodiments, the 
detection comprises a nucleic acid detection method selected from the group consisting of 



nucleic acid sequencing, polymerase chain reaction, hybridization, denaturing high 
pressure liquid chromatography, mass spectrometry, and enzymatic detection. 

The present invention further provides a kit comprising a reagent for detecting the 
presence or absence of a variant Cayman ataxia nucleic acid or polypeptide in a 
5 biological sample. In some embodiments, the kit further comprises instruction for using 
the kit for the detecting the presence or absence of a variant Cayman ataxia nucleic acid 
or polypeptide in a biological sample. In some embodiments, the instructions comprise 
instructions required by the U.S. Food and Drug Agency for in vitro diagnostic kits. In 
some embodiments, the kit further comprises instructions for diagnosing Caymans ataxia 

10 or Cayman ataxia carrier status in the subject based on the presence or absence of the 
variant Cayman ataxia polypeptide. In other embodiments, the kit further comprises 
instructions for diagnosing a disorder selected from the group consisting of ataxia, 
myoclonus, dystonia, epilepsy, and nystagmus in the subject based on the presence or 
absence of the variant Cayman ataxia polypeptide. In some embodiments, the reagent is 

1 5 one or more antibodies. In other embodiments, the reagents comprise reagents for 

performing a nucleic acid detection assay selected from the group consisting of nucleic 
acid sequencing, polymerase chain reaction, hybridization, denaturing high pressure 
liquid chromatography, mass spectrometry, and enzymatic detection. In some 
embodiments, the variant Cayman ataxia polypeptide is a variant of SEQJD NO:4 (e.g., 

20 SEQ ID NO:9). In some embodiments, the variant Cayman ataxia nucleic acid is a 
variant of SEQ ID NO: 3 or SEQ ID NO: 1 1 (e.g., SEQ ID NOs: 8 and 10). In some 
embodiments, the biological sample is selected from the group consisting of a blood 
sample, a tissue sample, a saliva sample, a urine sample, and an amniotic fluid sample. 

25 DESCRIPTION OF THE FIGURES 

Figure 1 shows the nucleic acid sequence of the mouse Jittery gene (SEQ ID 

NO:l). 

Figure 2 shows the amino acid sequence of the mouse Jittery protein (SEQ ED 

NO:2). 

30 Figure 3 shows the nucleic acid sequence of the human Cayman ataxia mRNA 

sequence (SEQ ID NO:3). 
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Figure 4 shows the amino acid sequence of the human Cayman ataxia protein 
(SEQIDNO:4). 

Figure 5 shows a comparative map of human chromosome 19pl3.3 and mouse 
chromosome 10 used in the development of some embodiments of the present invention. 
5 Figure 6 shows an alignment of human KIA1872 (SEQ ID NO:5), macaque (SEQ 

ID NO:6), mouse jittery (SEQ ID NO:7), human NIP2 (SEQ ID. NO: 8), and mouse NIP2 
(SEQ ED NO:9) polypeptides. 

Figure 7 shows a hydrophobicity plot of mouse jittery. 
Figure 8 shows the nucleic acid sequence of SEQ ID NO:8 (Cayman Ataxia 
10 mutant). 

Figure 9 shows the amino acid sequence of SEQ ID NO: 9 (Cayman Ataxia 
mutant of SEQ ID NO:8). 

Figure 10 shows the nucleic acid sequence of SEQ ID NO:10 (Cayman Ataxia 
splice site mutant). 

15 Figure 1 1 shows the nucleic acid sequence of the human Cayman ataxia genomic 

DNA (SEQ ID NO: 11). 

Figure 12 shows a comparative map of human chromosome 19pl3.3 and mouse 
chromosome 10 used in the development of some embodiments of the present invention. 
Figure 13 shows a sequence alignment of predicted ATCAY protein sequences 
20 with related genes across species. 

Figure 14 shows recombinants in the region of atcay. 

Figure 15 shows Table 2, markers and primers used in mapping of atcay, 

DEFINITIONS 

25 To facilitate understanding of the invention, a number of terms are defined below. 

As used herein, the term "Cayman ataxia" or " Cayman ataxia" when used in 
reference to a protein or nucleic acid refers to a protein or nucleic acid encoding a protein 
that, in some mutant forms, is correlated with ataxia. The term Cayman ataxia 
encompasses both proteins that are identical to wild-type Cayman ataxia and those that 

30 are derived from wild type Cayman ataxia (e.g., variants of Cayman ataxia or chimeric 
genes constructed with portions of Cayman ataxia coding regions). In some 
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embodiments, the "Cayman ataxia" is the wild type nucleic acid (SEQ ED NO: 3) or 
amino acid (SEQ ID NO:4) sequence. In other embodiments, the "Cayman ataxia" is a 
variant or mutant (e.g., including, but not limited to, variants resulting in disease). 

As used herein, the term "instructions for using said kit for said detecting the 
5 presence or absence of a variant Cayman ataxia polypeptide in a said biological sample" 
includes instructions for using the reagents contained in the kit for the detection of variant 
and wild type Cayman ataxia polypeptides. In some embodiments, the instructions 
further comprise the statement of intended use required by the U.S. Food and Drug 
Administration (FDA) in labeling in vitro diagnostic products. The FDA classifies in 

10 vitro diagnostics as medical devices and requires that they be approved through the 

510(k) procedure. Information required in an application under 51 0(k) includes: 1) The 
in vitro diagnostic product name, including the trade or proprietary name, the common or 
usual name, and the classification name of the device; 2) The intended use of the product; 
3) The establishment registration number, if applicable, of the owner or operator 

15 submitting the 510(k) submission; the class in which the in vitro diagnostic product was 
placed under section 513 of the FD&C Act, if known, its appropriate panel, or, if the 
owner or operator determines that the device has not been classified under such section, a 
statement of that determination and the basis for the determination that the in vitro 
diagnostic product is not so classified; 4)Proposed labels, labeling and advertisements 

20 sufficient to describe the in vitro diagnostic product, its intended use, and directions for 
use. Where applicable, photographs or engineering drawings should be supplied; 5) A 
statement indicating that the device is similar to and/or different from other in vitro 
diagnostic products of comparable type in commercial distribution in the U.S., 
accompanied by data to support the statement; 6) A 510(k) summary of the safety and 

25 effectiveness data upon which the substantial equivalence determination is based; or a 
statement that the 510(k) safety and effectiveness information supporting the FDA 
finding of substantial equivalence will be made available to any person within 30 days of 
a written request; 7) A statement that the submitter believes, to the best of their 
knowledge, that all data and information submitted in the premarket notification are 

30 truthful and accurate and that no material fact has been omitted; 8) Any additional 

information regarding the in vitro diagnostic product requested that is necessary for the 
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FDA to make a substantial equivalency determination. Additional information is 
available at the Internet web page of the U.S. FDA. 

The term "gene" refers to a nucleic acid (e.g., DNA) sequence that comprises 
coding sequences necessary for the production of a polypeptide, RNA (e.g., including but 
5 not limited to, mRNA, tRNA and rRNA) or precursor (e.g., Cayman ataxia). The 

polypeptide, RNA, or precursor can be encoded by a full length coding sequence or by 
any portion of the coding sequence so long as the desired activity or functional properties 
(e.g., enzymatic activity, ligand binding, signal transduction, etc.) of the full-length or 
fragment are retained. The term also encompasses the coding region of a structural gene 

10 and the including sequences located adjacent to the coding region on both the 5* and 3' 
ends for a distance of about 1 kb on either end such that the gene corresponds to the 
length of the full-length mRNA. The sequences that are located 5 1 of the coding region 
and which are present on the mRNA are referred to as 5* untranslated sequences. The 
sequences that are located 3 ? or downstream of the coding region and that are present on 

15 the mRNA are referred to as 3' untranslated sequences. The term "gene" encompasses 
both cDNA and genomic forms of a gene. A genomic form or clone of a gene contains 
the coding region interrupted with non-coding sequences termed "introns" or "intervening 
regions" or "intervening sequences." Introns are segments of a gene that are transcribed 
into nuclear RNA (hnRNA); introns may contain regulatory elements such as enhancers. 

20 Introns are removed or "spliced out" from the nuclear or primary transcript; introns 
therefore are absent in the messenger RNA (mRNA) transcript. The mRNA functions 
during translation to specify the sequence or order of amino acids in a nascent 
polypeptide. 

In particular, the term "Cayman ataxia gene" refers to the full-length Cayman 
25 ataxia genomic sequence (e.g., contained in SEQ ID NO: 1 1) or mRNA sequence (e.g., 
contained in SEQ ID NO:3). However, it is also intended that the term encompass 
fragments of the Cayman ataxia sequence, mutants as well as other domains within the 
full-length Cayman ataxia nucleotide sequence. Furthermore, the terms "Cayman ataxia 
nucleotide sequence" or "Cayman ataxia polynucleotide sequence" encompasses DNA, 
30 cDNA, and RNA (e.g., mRNA) sequences. 
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Where "amino acid sequence" is recited herein to refer to an amino acid sequence 
of a naturally occurring protein molecule, "amino acid sequence" and like terms, such as 
"polypeptide" or "protein" are not meant to limit the amino acid sequence to the 
complete, native amino acid sequence associated with the recited protein molecule. 
5 In addition to containing introns, genomic forms of a gene may also include 

sequences located on both the 5' and 3' end of the sequences that are present on the RNA 
transcript. These sequences are referred to as "flanking" sequences or regions (these 
flanking sequences are located 5' or 3 1 to the non-translated sequences present on the 
mRNA transcript). The 5 f flanking region may contain regulatory sequences such as 

10 promoters and enhancers that control or influence the transcription of the gene. The 3' 
flanking region may contain sequences that direct the termination of transcription, 
post-transcriptional cleavage and polyadenylation. 

The term "wild-type" refers to a gene or gene product that has the characteristics 
of that gene or gene product when isolated from a naturally occurring source. A wild- 

15 type gene is that which is most frequently observed in a population and is thus arbitrarily 
designed the "normal" or "wild-type" form of the gene. In contrast, the terms "modified," 
"mutant," "polymorphism," and "variant" refer to a gene or gene product that displays 
modifications in sequence and/or functional properties (i.e., altered characteristics) when 
compared to the wild-type gene or gene product. It is noted that naturally-occurring 

20 mutants can be isolated; these are identified by the fact that they have altered 
characteristics when compared to the wild-type gene or gene product. 

As used herein, the terms "nucleic acid molecule encoding," "DNA sequence 
encoding," and "DNA encoding" refer to the order or sequence of deoxyribonucleotides 
along a strand of deoxyribonucleic acid. The order of these deoxyribonucleotides 

25 determines the order of amino acids along the polypeptide (protein) chain. The DNA 
sequence thus codes for the amino acid sequence. 

DNA molecules are said to have "5 1 ends" and "3 ! ends" because mononucleotides 
are reacted to make oligonucleotides or polynucleotides in a manner such that the 5' 
phosphate of one mononucleotide pentose ring is attached to the 3' oxygen of its neighbor 

30 in one direction via a phosphodiester linkage. Therefore, an end of an oligonucleotides or 
polynucleotide, referred to as the "5' end" if its 5* phosphate is not linked to the 3' oxygen 
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of a mononucleotide pentose ring and as the M 3' end" if its 3' oxygen is not linked to a 5' 
phosphate of a subsequent mononucleotide pentose ring. As used herein, a nucleic acid 
sequence, even if internal to a larger oligonucleotide or polynucleotide, also may be said 
to have 5' and 3 1 ends. In either a linear or circular DNA molecule, discrete elements are 
5 referred to as being "upstream" or 5' of the "downstream" or 3' elements. This 

terminology reflects the fact that transcription proceeds in a 5' to 3' fashion along the 
DNA strand. The promoter and enhancer elements that direct transcription of a linked 
gene are generally located 5' or upstream of the coding region. However, enhancer 
elements can exert their effect even when located 3' of the promoter element and the 

10 coding region. Transcription termination and polyadenylation signals are located 3 1 or 
downstream of the coding region. 

As used herein, the terms "an oligonucleotide having a nucleotide sequence 
encoding a gene" and "polynucleotide having a nucleotide sequence encoding a gene," 
means a nucleic acid sequence comprising the coding region of a gene or, in other words, 

15 the nucleic acid sequence that encodes a gene product. The coding region may be present 
in a cDNA, genomic DNA, or RNA form. When present in a DNA form, the 
oligonucleotide or polynucleotide may be single-stranded {i.e., the sense strand) or 
double-stranded. Suitable control elements such as enhancers/promoters, splice 
junctions, polyadenylation signals, etc. may be placed in close proximity to the coding 

20 region of the gene if needed to permit proper initiation of transcription and/or correct 

processing of the primary RNA transcript. Alternatively, the coding region utilized in the 
expression vectors of the present invention may contain endogenous 
enhancers/promoters, splice junctions, intervening sequences, polyadenylation signals, 
etc. or a combination of both endogenous and exogenous control elements. 

25 As used herein, the term "regulatory element" refers to a genetic element that 

controls some aspect of the expression of nucleic acid sequences. For example, a 
promoter is a regulatory element that facilitates the initiation of transcription of an 
operably linked coding region. Other regulatory elements include splicing signals, 
polyadenylation signals, termination signals, etc. 

30 As used herein, the terms "complementary" or "complementarity" are used in 

reference to polynucleotides {i.e., a sequence of nucleotides) related by the base-pairing 
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rules. For example, for the sequence 5 , - ,t A-G-T-3', M is complementary to the sequence 3- 
"T-C-A-5\" Complementarity may be "partial," in which only some of the nucleic acids 1 
bases are matched according to the base pairing rules. Or, there may be "complete" or 
"total" complementarity between the nucleic acids. The degree of complementarity 
5 between nucleic acid strands has significant effects on the efficiency and strength of 
hybridization between nucleic acid strands. This is of particular importance in 
amplification reactions, as well as detection methods that depend upon binding between 
nucleic acids. 

The term "homology" refers to a degree of complementarity. There may be 

10 partial homology or complete homology (i.e., identity). A partially complementary 

sequence is one that at least partially inhibits a completely complementary sequence from 
hybridizing to a target nucleic acid and is referred to using the functional term 
"substantially homologous." The term "inhibition of binding," when used in reference to 
nucleic acid binding, refers to inhibition of binding caused by competition of homologous 

1 5 sequences for binding to a target sequence. The inhibition of hybridization of the 

completely complementary sequence to the target sequence may be examined using a 
hybridization assay (Southern or Northern blot, solution hybridization and the like) under 
conditions of low stringency. A substantially homologous sequence or probe will 
compete for and inhibit the binding (i.e., the hybridization) of a completely homologous 

20 to a target under conditions of low stringency. This is not to say that conditions of low 
stringency are such that non-specific binding is permitted; low stringency conditions 
require that the binding of two sequences to one another be a specific (i.e., selective) 
interaction. The absence of non-specific binding may be tested by the use of a second 
target that lacks even a partial degree of complementarity (e.g., less than about 30% 

25 identity); in the absence of non-specific binding the probe will not hybridize to the 
second non-complementary target. 

The art knows well that numerous equivalent conditions may be employed to 
comprise low stringency conditions; factors such as the length and nature (DNA, RNA, 
base composition) of the probe and nature of the target (DNA, RNA, base composition, 

30 . present in solution or immobilized, etc.) and the concentration of the salts and other 
components (e.g., the presence or absence of formamide, dextran sulfate, polyethylene 
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glycol) are considered and the hybridization solution may be varied to generate 
conditions of low stringency hybridization different from, but equivalent to, the above 
listed conditions. In addition, the art knows conditions that promote hybridization under 
conditions of high stringency (e.g., increasing the temperature of the hybridization and/or 
5 wash steps, the use of formamide in the hybridization solution, etc.). 

When used in reference to a double-stranded nucleic acid sequence such as a 
cDNA or genomic clone, the term "substantially homologous" refers to any probe that 
can hybridize to either or both strands of the double-stranded nucleic acid sequence under 
conditions of low stringency as described above. 

10 A gene may produce multiple RNA species that are generated by differential 

splicing of the primary RNA transcript. cDNAs that are splice variants of the same gene 
will contain regions of sequence identity or complete homology (representing the 
presence of the same exon or portion of the same exon on both cDNAs) and regions of 
complete non-identity (for example, representing the presence of exon "A" on cDNA 1 

15 wherein cDNA 2 contains exon "B" instead). Because the two cDNAs contain regions of 
sequence identity they will both hybridize to a probe derived from the entire gene or 
portions of the gene containing sequences found on both cDNAs; the two splice variants 
are therefore substantially homologous to such a probe and to each other. 

When used in reference to a single-stranded nucleic acid sequence, the term 

20 "substantially homologous" refers to any probe that can hybridize (i.e., it is the 
complement of) the single-stranded nucleic acid sequence under conditions of low 
stringency as described above. 

As used herein, the term "competes for binding" is used in reference to a first 
polypeptide with an activity which binds to the same substrate as does a second 

25 polypeptide with an activity, where the second polypeptide is a variant of the first 
polypeptide or a related or dissimilar polypeptide. The efficiency (e.g., kinetics or 
thermodynamics) of binding by the first polypeptide may be the same as or greater than 
or less than the efficiency substrate binding by the second polypeptide. For example, the 
equilibrium binding constant (Kq) for binding to the substrate may be different for the 

30 two polypeptides. The term "K m " as used herein refers to the Michaelis-Menton constant 
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for an enzyme and is defined as the concentration of the specific substrate at which a 
given enzyme yields one-half its maximum velocity in an enzyme catalyzed reaction. 

As used herein, the term "hybridization" is used in reference to the pairing of 
complementary nucleic acids. Hybridization and the strength of hybridization (i.e., the 
5 strength of the association between the nucleic acids) is impacted by such factors as the 
degree of complementary between the nucleic acids, stringency of the conditions 
involved, the T m of the formed hybrid, and the G:C ratio within the nucleic acids. 

As used herein, the term "T m lf is used in reference to the "melting temperature." 
The melting temperature is the temperature at which a population of double-stranded 

10 nucleic acid molecules becomes half dissociated into single strands. The equation for 
calculating the T m of nucleic acids is well known in the art. As indicated by standard 
references, a simple estimate of the T m value may be calculated by the equation: T m = 
81.5 + 0.4 1(% G + C), when a nucleic acid is in aqueous solution at 1 M NaCl (See e.g., 
Anderson and Young, Quantitative Filter Hybridization, in Nucleic Acid Hybridization 

15 [1985]). Other references include more sophisticated computations that take structural as 
well as sequence characteristics into account for the calculation of T m . 

As used herein the term "stringency" is used in reference to the conditions of 
temperature, ionic strength, and the presence of other compounds such as organic 
solvents, under which nucleic acid hybridizations are conducted. Those skilled in the art 

20 will recognize that "stringency" conditions may be altered by varying the parameters just 
described either individually or in concert. With "high stringency" conditions, nucleic, 
acid base pairing will occur only between nucleic acid fragments that have a high 
frequency of complementary base sequences (e.g., hybridization under "high stringency" 
conditions may occur between homologs with about 85-100% identity, preferably about 

25 70-100% identity). With medium stringency conditions, nucleic acid base pairing will 
occur between nucleic acids with an intermediate frequency of complementary base 
sequences (e.g., hybridization under "medium stringency" conditions may occur between 
homologs with about 50-70% identity). Thus, conditions of "weak" or "low" stringency 
are often required with nucleic acids that are derived from organisms that are genetically 

30 diverse, as the frequency of complementary sequences is usually less. 
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"High stringency conditions' 1 when used in reference to nucleic acid hybridization 
comprise conditions equivalent to binding or hybridization at 42°C in a solution 
consisting of 5X SSPE (43.8 g/1 NaCl, 6.9 g/1 NaH 2 PC>4 H 2 0 and 1.85 g/1 EDTA, pH 

adjusted to 7.4 with NaOH), 0.5% SDS, 5X Denhardt's reagent and 100 \ig/m\ denatured 
5 salmon sperm DNA followed by washing in a solution comprising 0.1X SSPE, 1.0% SDS 
at 42°C when a probe of about 500 nucleotides in length is employed. 

"Medium stringency conditions" when used in reference to nucleic acid 
hybridization comprise conditions equivalent to binding or hybridization at 42°C in a 
solution consisting of 5X SSPE (43.8 g/1 NaCl, 6.9 g/1 NaH 2 PC>4 H 2 0 and 1.85 g/1 

10 EDTA, pH adjusted to 7.4 with NaOH), 0.5% SDS, 5X Denhardt's reagent and 100 ng/ml 
denatured salmon sperm DNA followed by washing in a solution comprising 1.0X SSPE, 
1.0% SDS at 42°C when a probe of about 500 nucleotides in length is employed. 

"Low stringency conditions" comprise conditions equivalent to binding or 
hybridization at 42°C in a solution consisting of 5X SSPE (43.8 g/1 NaCl, 6.9 g/1 

15 NaH 2 P0 4 H 2 0 and 1.85 g/1 EDTA, pH adjusted to 7.4 with NaOH), 0.1% SDS, 5X 
Denhardt's reagent [SOX Denhardt's contains per 500 ml: 5 g Ficoll (Type 400, 
Pharamcia), 5 g BSA (Fraction V; Sigma)] and 100 ^ig/ml denatured salmon sperm DNA 
followed by washing in a solution comprising 5X SSPE, 0.1% SDS at 42°C when a probe 
of about 500 nucleotides in length is employed. The present invention is not limited to 

20 the hybridization of probes of about 500 nucleotides in length. The present invention 
contemplates the use of probes between approximately 10 nucleotides up to several 
thousand (e.g., at least 5000) nucleotides in length. 

One skilled in the relevant understands that stringency conditions may be altered 
for probes of other sizes (See e.g., Anderson and Young, Quantitative Filter 

25 Hybridization, in Nucleic Acid Hybridization [1985] and Sambrook et al., Molecular 
Cloning: A Laboratory Manual, Cold Spring Harbor Press, NY [1989]). 

The following terms are used to describe the sequence relationships between two 
or more polynucleotides: "reference sequence", "sequence identity", "percentage of 
sequence identity", and "substantial identity". A "reference sequence" is a defined 

30 sequence used as a basis for a sequence comparison; a reference sequence may be a 
subset of a larger sequence, for example, as a segment of a full-length cDNA sequence 
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given in a sequence listing or may comprise a complete gene sequence. Generally, a 
reference sequence is at least 20 nucleotides in length, frequently at least 25 nucleotides 
in length, and often at least 50 nucleotides in length. Since two polynucleotides may 
each (1) comprise a sequence (i.e., a portion of the complete polynucleotide sequence) 
5 that is similar between the two polynucleotides, and (2) may further comprise a sequence 
that is divergent between the two polynucleotides, sequence comparisons between two 
(or more) polynucleotides are typically performed by comparing sequences of the two 
polynucleotides over a "comparison window" to identify and compare local regions of 
sequence similarity. A "comparison window", as used herein, refers to a conceptual 

10 segment of at least 20 contiguous nucleotide positions wherein a polynucleotide sequence 
may be compared to a reference sequence of at least 20 contiguous nucleotides and 
wherein the portion of the polynucleotide sequence in the comparison window may 
comprise additions or deletions (i.e., gaps) of 20 percent or less as compared to the 
reference sequence (which does not comprise additions or deletions) for optimal 

15 alignment of the two sequences. Optimal alignment of sequences for aligning a 

comparison window may be conducted by the local homology algorithm of Smith and 
Waterman [Smith and Waterman, Adv. Appl Math. 2: 482 (1981)] by the homology 
alignment algorithm of Needleman and Wunsch [Needleman and Wunsch, J. Mol. Biol. 
48:443 (1970)], by the search for similarity method of Pearson and Lipman [Pearson and 

20 Lipman, Proc. Natl. Acad. Sci. (U.S.A.) 85:2444 (1988)], by computerized 

implementations of these algorithms (GAP, BESTFIT, FASTA, and TFASTA in the 
Wisconsin Genetics Software Package Release 7.0, Genetics Computer Group, 575 
Science Dr., Madison, Wis.), or by inspection, and the best alignment (i.e., resulting in 
the highest percentage of homology over the comparison window) generated by the 

25 various methods is selected. The term "sequence identity" means that two polynucleotide 
sequences are identical (i.e., on a nucleotide-by-nucleotide basis) over the window of 
comparison. The term "percentage of sequence identity" is calculated by comparing two 
optimally aligned sequences over the window of comparison, determining the number of 
positions at which the identical nucleic acid base (e.g., A, T, C, G, U, or I) occurs in both 

30 sequences to yield the number of matched positions, dividing the number of matched 

positions by the total number of positions in the window of comparison (i.e., the window 
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size), and multiplying the result by 100 to yield the percentage of sequence identity. The 
terms "substantial identity 11 as used herein denotes a characteristic of a polynucleotide 
sequence, wherein the polynucleotide comprises a sequence that has at least 85 percent 
sequence identity, preferably at least 90 to 95 percent sequence identity, more usually at 
5 least 99 percent sequence identity as compared to a reference sequence over a 

comparison window of at least 20 nucleotide positions, frequently over a window of at 
least 25-50 nucleotides, wherein the percentage of sequence identity is calculated by 
comparing the reference sequence to the polynucleotide sequence which may include 
deletions or additions which total 20 percent or less of the reference sequence over the 

10 window of comparison. The reference sequence may be a subset of a larger sequence, for 
example, as a segment of the full-length sequences of the compositions claimed in the 
present invention (e.g., Cayman ataxia). 

As applied to polypeptides, the term "substantial identity" means that two peptide 
sequences, when optimally aligned, such as by the programs GAP or BESTFIT using 

15 default gap weights, share at least 80 percent sequence identity, preferably at least 90 
percent sequence identity, more preferably at least 95 percent sequence identity or more 
(e.g., 99 percent sequence identity). Preferably, residue positions that are not identical 
differ by conservative amino acid substitutions. Conservative amino acid substitutions 
refer to the interchangeability of residues having similar side chains. For example, a 

20 group of amino acids having aliphatic side chains is glycine, alanine, valine, leucine, and 
isoleucine; a group of amino acids having aliphatic-hydroxyl side chains is serine and 
threonine; a group of amino acids having amide-containing side chains is asparagine and 
glutamine; a group of amino acids having aromatic side chains is phenylalanine, tyrosine, 
and tryptophan; a group of amino acids having basic side chains is lysine, arginine, and 

25 histidine; and a group of amino acids having sulfur-containing side chains is cysteine and 
methionine. Preferred conservative amino acids substitution groups are: valine-leucine- 
isoleucine, phenylalanine-tyrosine, lysine-arginine, alanine-valine, and asparagine- 
glutamine. 

The term "fragment" as used herein refers to a polypeptide that has an amino- 
30 terminal and/or carboxy-terminal deletion as compared to the native protein, but where 
the remaining amino acid sequence is identical to the corresponding positions in the 
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amino acid sequence deduced from a full-length cDNA sequence. Fragments typically 
are at least 4 amino acids long, preferably at least 20 amino acids long, usually at least 50 
amino acids long or longer, and span the portion of the polypeptide required for 
intermolecular binding of the compositions (claimed in the present invention) with its 
5 various ligands and/or substrates. 

The term "polymorphic locus" is a locus present in a population that shows 
variation between members of the population (i.e., the most common allele has a 
frequency of less than 0.95). In contrast, a "monomorphic locus" is a genetic locus at 
little or no variations seen between members of the population (generally taken to be a 
10 locus at which the most common allele exceeds a frequency of 0.95 in the gene pool of 
the population). 

As used herein, the term "genetic variation information" or "genetic variant 
information" refers to the presence or absence of one or more variant nucleic acid 
sequences (e.g., polymorphism or mutations) in a given allele of a particular gene (e.g., 

1 5 the Cayman ataxia gene). 

As used herein, the term "detection assay" refers to an assay for detecting the 
presence of absence of variant nucleic acid sequences (e.g., polymorphism or mutations) 
in a given allele of a particular gene (e.g., the Cayman ataxia gene). Examples of suitable 
detection assays include, but are not limited to, those described below in Section III B. 

20 The term "naturally-occurring" as used herein as applied to an object refers to the 

fact that an object can be found in nature. For example, a polypeptide or polynucleotide 
sequence that is present in an organism (including viruses) that can be isolated from a 
source in nature and which has not been intentionally modified by man in the laboratory 
is naturally-occurring. 

25 "Amplification" is a special case of nucleic acid replication involving template 

specificity. It is to be contrasted with non-specific template replication (i.e., replication 
that is template-dependent but not dependent on a specific template). Template 
specificity is here distinguished from fidelity of replication (i.e., synthesis of the proper 
polynucleotide sequence) and nucleotide (ribo- or deoxyribo-) specificity. Template 

30 specificity is frequently described in terms of "target" specificity. Target sequences are 
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"targets" in the sense that they are sought to be sorted out from other nucleic acid. 
Amplification techniques have been designed primarily for this sorting out. 

Template specificity is achieved in most amplification techniques by the choice of 
enzyme. Amplification enzymes are enzymes that, under conditions they are used, will 
5 process only specific sequences of nucleic acid in a heterogeneous mixture of nucleic 
acid. For example, in the case of Qp replicase, MDV-1 RNA is the specific template for 
the replicase (D.L. Kacian et ai 9 Proc. Natl. Acad. Sci. USA 69:3038 [1972]). Other 
nucleic acid will not be replicated by this amplification enzyme. Similarly, in the case of 
T7 RNA polymerase, this amplification enzyme has a stringent specificity for its own 

1 0 promoters (Chamberlin et al , Nature 228 :227 [ 1 970]). In the case of T4 DNA ligase, the 
enzyme will not ligate the two oligonucleotides or polynucleotides, where there is a 
mismatch between the oligonucleotide or polynucleotide substrate and the template at the 
ligation junction (D.Y. Wu and R. B. Wallace, Genomics 4:560 [1989]). Finally, Taq 
and Pfu polymerases, by virtue of their ability to function at high temperature, are found 

15 to display high specificity for the sequences bounded and thus defined by the primers; the 
high temperature results in thermodynamic conditions that favor primer hybridization 
with the target sequences and not hybridization with non-target sequences (H. A. Erlich 
(ed.), PCR Technology, Stockton Press [1989]). 

As used herein, the term "amplifiable nucleic acid" is used in reference to nucleic 

20 acids that may be amplified by any amplification method. It is contemplated that 
"amplifiable nucleic acid" will usually comprise "sample template." 

As used herein, the term "sample template" refers to nucleic acid originating from 
a sample that is analyzed for the presence of "target" (defined below). In contrast, 
"background template" is used in reference to nucleic acid other than sample template 

25 that may or may not be present in a sample. Background template is most often 

inadvertent. It may be the result of carryover, or it may be due to the presence of nucleic 
acid contaminants sought to be purified away from the sample. For example, nucleic 
acids from organisms other than those to be detected may be present as background in a 
test sample. 

30 As used herein, the term "primer" refers to an oligonucleotide, whether occurring 

naturally as in a purified restriction digest or produced synthetically, which is capable of 
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acting as a point of initiation of synthesis when placed under conditions in which 
synthesis of a primer extension product which is complementary to a nucleic acid strand 
is induced, (i.e., in the presence of nucleotides and an inducing agent such as DNA 
polymerase and at a suitable temperature and pH). The primer is preferably single 
5 stranded for maximum efficiency in amplification, but may alternatively be double 

stranded. If double stranded, the primer is first treated to separate its strands before being 
used to prepare extension products. Preferably, the primer is an 

oligodeoxyribonucleotide. The primer must be sufficiently long to prime the synthesis of 
extension products in the presence of the inducing agent. The exact lengths of the 
10 primers will depend on many factors, including temperature, source of primer and the use 
of the method. 

As used herein, the term "probe" refers to an oligonucleotide (i.e., a sequence of 
nucleotides), whether occurring naturally as in a purified restriction digest or produced 
synthetically, recombinantly or by PCR amplification, that is capable of hybridizing to 

1 5 another oligonucleotide of interest. A probe may be single-stranded or double-stranded. 
Probes are useful in the detection, identification and isolation of particular gene 
sequences. It is contemplated that any probe used in the present invention will be labeled 
with any "reporter molecule," so that is detectable in any detection system, including, but 
not limited to enzyme (e.g., ELISA, as well as enzyme-based histochemical assays), 

20 fluorescent, radioactive, and luminescent systems. It is not intended that the present 
invention be limited to any particular detection system or label. 

As used herein, the term "target," refers to a nucleic acid sequence or structure to 
be detected or characterized. Thus, the "target" is sought to be sorted out from other 
nucleic acid sequences. A "segment" is defined as a region of nucleic acid within the 

25 target sequence. 

As used herein, the term "polymerase chain reaction" ("PCR") refers to the 
method of K.B. Mullis U.S. Patent Nos. 4,683,195, 4,683,202, and 4,965,188, hereby 
incorporated by reference, that describe a method for increasing the concentration of a 
segment of a target sequence in a mixture of genomic DNA without cloning or 

30 purification. This process for amplifying the target sequence consists of introducing a 
large excess of two oligonucleotide primers to the DNA mixture containing the desired 
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target sequence, followed by a precise sequence of thermal cycling in the presence of a 
DNA polymerase. The two primers are complementary to their respective strands of the 
double stranded target sequence. To effect amplification, the mixture is denatured and 
the primers then annealed to their complementary sequences within the target molecule. 
5 Following annealing, the primers are extended with a polymerase so as to form a new 
pair of complementary strands. The steps of denaturation, primer annealing, and 
polymerase extension can be repeated many times (i.e. 9 denaturation, annealing and 
extension constitute one "cycle"; there can be numerous "cycles") to obtain a high 
concentration of an amplified segment of the desired target sequence. The length of the 

10 amplified segment of the desired target sequence is determined by the relative positions 
of the primers with respect to each other, and therefore, this length is a controllable 
parameter. By virtue of the repeating aspect of the process, the method is referred to as 
the "polymerase chain reaction" (hereinafter "PCR"). Because the desired amplified 
segments of the target sequence become the predominant sequences (in terms of 

15 concentration) in the mixture, they are said to be "PCR amplified." 

With PCR, it is possible to amplify a single copy of a specific target sequence in 
genomic DNA to a level detectable by several different methodologies (e.g., 
hybridization with a labeled probe; incorporation of biotinylated primers followed by 
avidin-enzyme conjugate detection; incorporation of 32p-i a beled deoxynucleotide 

20 triphosphates, such as dCTP or dATP, into the amplified segment). In addition to 

genomic DNA, any oligonucleotide or polynucleotide sequence can be amplified with the 
appropriate set of primer molecules. In particular, the amplified segments created by the 
PCR process itself are, themselves, efficient templates for subsequent PCR 
amplifications. 

25 As used herein, the terms "PCR product," "PCR fragment," and "amplification 

product" refer to the resultant mixture of compounds after two or more cycles of the PCR 
steps of denaturation, annealing and extension are complete. These terms encompass the 
case where there has been amplification of one or more segments of one or more target 
sequences. 

30 As used herein, the term "amplification reagents" refers to those reagents 

(deoxyribonucleotide triphosphates, buffer, etc.), needed for amplification except for 
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primers, nucleic acid template, and the amplification enzyme. Typically, amplification 
reagents along with other reaction components are placed and contained in a reaction 
vessel (test tube, microwell, etc.). 

As used herein, the terms "restriction endonucleases" and "restriction enzymes" 
5 refer to bacterial enzymes, each of which cut double-stranded DNA at or near a specific 
nucleotide sequence. 

As used herein, the term "recombinant DNA molecule" as used herein refers to a 
DNA molecule that is comprised of segments of DNA joined together by means of 
molecular biological techniques. 

10 As used herein, the term "antisense" is used in reference to RNA sequences that 

are complementary to a specific RNA sequence (e.g., mRNA). Included within this 
definition are antisense RNA ("asRNA") molecules involved in gene regulation by 
bacteria. Antisense RNA may be produced by any method, including synthesis by 
splicing the gene(s) of interest in a reverse orientation to a viral promoter that permits the 

15 synthesis of a coding strand. Once introduced into an embryo, this transcribed strand 

combines with natural mRNA produced by the embryo to form duplexes. These duplexes 
then block either the further transcription of the mRNA or its translation. In this manner, 
mutant phenotypes may be generated. The term "antisense strand" is used in reference to 
a nucleic acid strand that is complementary to the "sense" strand. The designation (-) 

20 (i.e., "negative") is sometimes used in reference to the antisense strand, with the 
designation (+) sometimes used in reference to the sense (i.e., "positive") strand. 

The term "isolated" when used in relation to a nucleic acid, as in "an isolated 
oligonucleotide" or "isolated polynucleotide" refers to a nucleic acid sequence that is 
identified and separated from at least one contaminant nucleic acid with which it is 

25 ordinarily associated in its natural source. Isolated nucleic acid is present in a form or 
setting that is different from that in which it is found in nature. In contrast, non-isolated 
nucleic acids are nucleic acids such as DNA and RNA found in the state they exist in 
nature. For example, a given DNA sequence (e.g., a gene) is found on the host cell 
chromosome in proximity to neighboring genes; RNA sequences, such as a specific 

30 mRNA sequence encoding a specific protein, are found in the cell as a mixture with 

numerous other mRNAs that encode a multitude of proteins. However, isolated nucleic 



19 



acid encoding Cayman ataxia includes, by way of example, such nucleic acid in cells 
ordinarily expressing Cayman ataxia where the nucleic acid is in a chromosomal location 
different from that of natural cells, or is otherwise flanked by a different nucleic acid 
sequence than that found in nature. The isolated nucleic acid, oligonucleotide, or 
5 polynucleotide may be present in single-stranded or double-stranded form. When an 
isolated nucleic acid, oligonucleotide or polynucleotide is to be utilized to express a 
protein, the oligonucleotide or polynucleotide will contain at a minimum the sense or 
coding strand {i.e., the oligonucleotide or polynucleotide may single-stranded), but may 
contain both the sense and anti-sense strands {i.e., the oligonucleotide or polynucleotide 

1 0 may be double-stranded). 

As used herein, a "portion of a chromosome" refers to a discrete section of the 
chromosome. Chromosomes are divided into sites or sections by cytogeneticists as 
follows: the short (relative to the centromere) arm of a chromosome is termed the "p" 
arm; the long arm is termed the "q" arm. Each arm is then divided into 2 regions termed 

15 region 1 and region 2 (region 1 is closest to the centromere). Each region is further 

divided into bands. The bands may be further divided into sub-bands. For example, the 
1 lpl5.5 portion of human chromosome 1 1 is the portion located on chromosome 1 1 (1 1) 
on the short arm (p) in the first region (1) in the 5th band (5) in sub-band 5 (.5). A 
portion of a chromosome may be "altered;" for instance the entire portion may be absent 

20 due to a deletion or may be rearranged {e.g., inversions, translocations, expanded or 
contracted due to changes in repeat regions). In the case of a deletion, an attempt to 
hybridize {i.e., specifically bind) a probe homologous to a particular portion of a 
chromosome could result in a negative result {i.e., the probe could not bind to the sample 
containing genetic material suspected of containing the missing portion of the 

25 chromosome). Thus, hybridization of a probe homologous to a particular portion of a 
chromosome may be used to detect alterations in a portion of a chromosome. 

The term "sequences associated with a chromosome" means preparations of 
chromosomes {e.g., spreads of metaphase chromosomes), nucleic acid extracted from a 
sample containing chromosomal DNA {e.g. , preparations of genomic DNA); the RNA 

30 that is produced by transcription of genes located on a chromosome {e.g., hnRNA and 
mRNA), and cDNA copies of the RNA transcribed from the DNA located on a 
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chromosome. Sequences associated with a chromosome may be detected by numerous 
techniques including probing of Southern and Northern blots and in situ hybridization to 
RNA, DNA, or metaphase chromosomes with probes containing sequences homologous 
to the nucleic acids in the above listed preparations. 
5 As used herein the term "portion" when in reference to a nucleotide sequence (as 

in "a portion of a given nucleotide sequence") refers to fragments of that sequence. The 
fragments may range in size from four nucleotides to the entire nucleotide sequence 
minus one nucleotide (10 nucleotides, 20, 30, 40, 50, 100, 200, etc.). 

As used herein the term "coding region" when used in reference to structural gene 

10 refers to the nucleotide sequences that encode the amino acids found in the nascent 
polypeptide as a result of translation of a mRNA molecule. The coding region is 
bounded, in eukaryotes, on the 5' side by the nucleotide triplet "ATG" that encodes the 
initiator methionine and on the 3' side by one of the three triplets, which specify stop 
codons {i.e., TAA, TAG, TGA). 

15 As used herein, the term "purified" or "to purify" refers to the removal of 

contaminants from a sample. For example, antibodies to Cayman ataxia protein are 
purified by removal of contaminating non-immunoglobulin proteins; they are also 
purified by the removal of immunoglobulin that does not bind Cayman ataxia. The 
removal of non-immunoglobulin proteins and/or the removal of immunoglobulins that do 

20 not bind Cayman ataxia results in an increase in the percent of Cayman ataxia-reactive 
immunoglobulins in the sample. In another example, recombinant Cayman ataxia 
polypeptides are expressed in bacterial host cells and the polypeptides are purified by the 
removal of host cell proteins; the percent of recombinant Cayman ataxia polypeptides is 
thereby increased in the sample. 

25 The term "recombinant DNA molecule" as used herein refers to a DNA molecule 

that is comprised of segments of DNA joined together by means of molecular biological 
techniques. 

The term "recombinant protein" or "recombinant polypeptide" as used herein 
refers to a protein molecule that is expressed from a recombinant DNA molecule. 
30 The term "native protein" as used herein to indicate that a protein does not contain 

amino acid residues encoded by vector sequences; that is the native protein contains only 
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those amino acids found in the protein as it occurs in nature. A native protein may be 
produced by recombinant means or may be isolated from a naturally occurring source. 

As used herein the term "portion" when in reference to a protein (as in "a portion 
of a given protein") refers to fragments of that protein. The fragments may range in size 
5 from four consecutive amino acid residues to the entire amino acid sequence minus one 
amino acid. 

The term "Southern blot," refers to the analysis of DNA on agarose or acrylamide 
gels to fractionate the DNA according to size followed by transfer of the DNA from the 
gel to a solid support, such as nitrocellulose or a nylon membrane. The immobilized 

1 0 DNA is then probed with a labeled probe to detect DNA species complementary to the 
probe used. The DNA may be cleaved with restriction enzymes prior to electrophoresis. 
Following electrophoresis, the DNA may be partially depurinated and denatured prior to 
or during transfer to the solid support. Southern blots are a standard tool of molecular 
biologists (J. Sambrook et al t Molecular Cloning: A Laboratory Manual, Cold Spring 

15 Harbor Press, NY, pp 9.31-9.58 [1989]). 

The term "Northern blot," as used herein refers to the analysis of RNA by 
electrophoresis of RNA on agarose gels to fractionate the RNA according to size 
followed by transfer of the RNA from the gel to a solid support, such as nitrocellulose or 
a nylon membrane. The immobilized RNA is then probed with a labeled probe to detect 

20 RNA species complementary to the probe used. Northern blots are a standard tool of 
molecular biologists (J. Sambrook, et al., supra, pp 7.39-7.52 [1989]). 

The term "Western blot" refers to the analysis of protein(s) (or polypeptides) 
immobilized onto a support such as nitrocellulose or a membrane. The proteins are run 
on acrylamide gels to separate the proteins, followed by transfer of the protein from the 

25 gel to a solid support, such as nitrocellulose or a nylon membrane. The immobilized 

proteins are then exposed to antibodies with reactivity against an antigen of interest. The 
binding of the antibodies may be detected by various methods, including the use of 
radiolabeled antibodies. 

The term "antigenic determinant" as used herein refers to that portion of an 

30 antigen that makes contact with a particular antibody (z.e., an epitope). When a protein or 
fragment of a protein is used to immunize a host animal, numerous regions of the protein 
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may induce the production of antibodies that bind specifically to a given region or three- 
dimensional structure on the protein; these regions or structures are referred to as 
antigenic determinants. An antigenic determinant may compete with the intact antigen 
(i.e., the "immunogen" used to elicit the immune response) for binding to an antibody. 
5 The term "transgene" as used herein refers to a foreign, heterologous, or 

autologous gene that is placed into an organism by introducing the gene into newly 
fertilized eggs or early embryos. The term "foreign gene" refers to any nucleic acid (e.g., 
gene sequence) that is introduced into the genome of an animal by experimental 
manipulations and may include gene sequences found in that animal so long as the 

10 introduced gene does not reside in the same location as does the naturally-occurring gene. 
The term "autologous gene" is intended to encompass variants (e.g. , polymorphisms or 
mutants) of the naturally occurring gene. The term transgene thus encompasses the 
replacement of the naturally occurring gene with a variant form of the gene. 

As used herein, the term "vector" is used in reference to nucleic acid molecules 

15 that transfer DNA segment(s) from one cell to another. The term "vehicle" is sometimes 
used interchangeably with "vector." 

The term "expression vector" as used herein refers to a recombinant DNA 
molecule containing a desired coding sequence and appropriate nucleic acid sequences 
necessary for the expression of the operably linked coding sequence in a particular host 

20 organism. Nucleic acid sequences necessary for expression in prokaryotes usually 

include a promoter, an operator (optional), and a ribosome binding site, often along with 
other sequences. Eukaryotic cells are known to utilize promoters, enhancers, and 
termination and polyadenylation signals. 

As used herein, the term "host cell" refers to any eukaryotic or prokaryotic cell 

25 (e.g., bacterial cells such as E. coli, yeast cells, mammalian cells, avian cells, amphibian 
cells, plant cells, fish cells, and insect cells), whether located in vitro or in vivo. For 
example, host cells may be located in a transgenic animal. 

The terms "overexpression" and "overexpressing" and grammatical equivalents, 
are used in reference to levels of mRNA to indicate a level of expression approximately 

30 3-fold higher than that typically observed in a given tissue in a control or non-transgenic 
animal. Levels of mRNA are measured using any of a number of techniques known to 
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those skilled in the art including, but not limited to Northern blot analysis (See, Example 
10, for a protocol for performing Northern blot analysis). Appropriate controls are 
included on the Northern blot to control for differences in the amount of RNA loaded 
from each tissue analyzed (e.g., the amount of 28S rRNA, an abundant RNA transcript 
5 present at essentially the same amount in all tissues, present in each sample can be used 
as a means of normalizing or standardizing the RAD50 mRNA-specific signal observed 
on Northern blots). The amount of mRNA present in the band corresponding in size to 
the correctly spliced Cayman ataxia transgene RNA is quantified; other minor species of 
RNA which hybridize to the transgene probe are not considered in the quantification of 
1 0 the expression of the transgenic mRNA. 

The term "transfection" as used herein refers to the introduction of foreign DNA 
into eukaryotic cells. Transfection may be accomplished by a variety of means known to 
the art including calcium phosphate-DNA co-precipitation, DEAE-dextran-mediated 
transfection, polybrene-mediated transfection, electroporation, microinjection, liposome 
15 fusion, lipofection, protoplast fusion, retroviral infection, and biolistics. 

The term "stable transfection" or "stably transfected" refers to the introduction 
and integration of foreign DNA into the genome of the transfected cell. The term "stable 
transfectant" refers to a cell that has stably integrated foreign DNA into the genomic 
DNA. 

20 The term "transient transfection" or "transiently transfected" refers to the 

introduction of foreign DNA into a cell where the foreign DNA fails to integrate into the 
genome of the transfected cell. The foreign DNA persists in the nucleus of the 
transfected cell for several days. During this time the foreign DNA is subject to the 
regulatory controls that govern the expression of endogenous genes in the chromosomes. 

25 The term "transient transfectant" refers to cells that have taken up foreign DNA but have 
failed to integrate this DNA. 

The term "calcium phosphate co-precipitation" refers to a technique for the 
introduction of nucleic acids into a cell. The uptake of nucleic acids by cells is enhanced 
when the nucleic acid is presented as a calcium phosphate-nucleic acid co-precipitate. 

30 The original technique of Graham and van der Eb (Graham and van der Eb, Virol., 



24 



52:456 [1973]), has been modified by several groups to optimize conditions for particular 
types of cells. The art is well aware of these numerous modifications. 

A "composition comprising a given polynucleotide sequence" as used herein 
refers broadly to any composition containing the given polynucleotide sequence. The 
5 composition may comprise an aqueous solution. Compositions comprising 

polynucleotide sequences encoding Cayman ataxia (e.g., SEQ ED NOs:3 or 1 1) or 
fragments thereof may be employed as hybridization probes. In this case, the Cayman 
ataxia encoding polynucleotide sequences are typically employed in an aqueous solution 
containing salts (e.g., NaCl), detergents (e.g., SDS), and other components (e.g., 

10 Denhardt's solution, dry milk, salmon sperm DNA, etc.). 

The term "test compound" refers to any chemical entity, pharmaceutical, drug, 
and the like that can be used to treat or prevent a disease, illness, sickness, or disorder of 
bodily function, or otherwise alter the physiological or cellular status of a sample. Test 
compounds comprise both known and potential therapeutic compounds. A test 

15 compound can be determined to be therapeutic by screening using the screening methods 
of the present invention. A "known therapeutic compound" refers to a therapeutic 
compound that has been shown (e.g., through animal trials or prior experience with 
administration to humans) to be effective in such treatment or prevention. 

As used herein, the term "sample" is used in its broadest sense. In one sense, it is 

20 meant to include a specimen or culture obtained from any source, as well as biological 
and environmental samples. Biological samples may be obtained from animals 
(including humans) and encompass fluids, solids, tissues, and gases (e.g., saliva, blood, 
urine, and tissue biopsies). Biological samples include blood products, such as plasma, 
serum and the like. Environmental samples include environmental material such as 

25 surface matter, soil, water, crystals and industrial samples. Such examples are not 

however to be construed as limiting the sample types applicable to the present invention. 

As used herein, the term "response," when used in reference to an assay, refers to 
the generation of a detectable signal (e.g., accumulation of reporter protein, increase in 
ion concentration, accumulation of a detectable chemical product). 

30 As used herein, the term "reporter gene" refers to a gene encoding a protein that 

may be assayed. Examples of reporter genes include, but are not limited to, luciferase 
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(See, e.g., deWet et ai, Mol. Cell. Biol. 7:725 [1987] and U.S. Pat Nos., 6,074,859; 
5,976,796; 5,674,713; and 5,618,682; all of which are incorporated herein by reference), 
green fluorescent protein (e.g., GenBank Accession Number U43284; a number of GFP 
variants are commercially available from CLONTECH Laboratories, Palo Alto, CA), 
5 chloramphenicol acetyltransferase, P-galactosidase, alkaline phosphatase, and horse 
radish peroxidase. 

As used herein, the terms "computer memory" and "computer memory device" 
refer to any storage media readable by a computer processor. Examples of computer 
memory include, but are not limited to, RAM, ROM, computer chips, digital video disc 

10 (DVDs), compact discs (CDs), hard disk drives (HDD), and magnetic tape. 

As used herein, the term "computer readable medium" refers to any device or 
system for storing and providing information (e.g., data and instructions) to a computer 
processor. Examples of computer readable media include, but are not limited to, DVDs, 
CDs, hard disk drives, magnetic tape and servers for streaming media over networks. 

15 As used herein, the term "entering" as in "entering said genetic variation 

information into said computer" refers to transferring information to a "computer 
readable medium." Information may be transferred by any suitable method, including but 
not limited to, manually (e.g., by typing into a computer) or automated (e.g., transferred 
from another "computer readable medium" via a "processor"). 
J20 As used herein, the terms "processor" and "central processing unit" or "CPU" are 

used interchangeably and refer to a device that is able to read a program from a computer 
memory (e.g., ROM or other computer memory) and perform a set of steps according to 
the program. 

As used herein, the term "computer implemented method" refers to a method 
25 utilizing a "CPU" and "computer readable medium." 

DETAILED DESCRIPTION OF THE INVENTION 

The present invention relates to ataxia, in particular to ataxia protein and nucleic 
acids encoding ataxia proteins. The present invention provides assays for the detection of 
30 ataxia and ataxia polymorphisms and mutations associated with disease states. 
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I. Cayman ataxia Polynucleotides 

The present invention provides a new gene associated with Caymans ataxia. 
Accordingly, the present invention provides nucleic acids encoding Cayman ataxia genes, 
homologs, variants (e.g., polymorphisms and mutants), including but not limited to, those 
5 described in SEQ ID NOs: 3, 1 1, 8, and 10. In some embodiments, the present invention 
provide polynucleotide sequences that are capable of hybridizing to SEQ ED NOs: 3, 1 1, 
8, and 10 under conditions of low to high stringency as long as the polynucleotide 
sequence capable of hybridizing encodes a protein that retains a biological activity of the 
naturally occurring Cayman ataxia. In some embodiments, the protein that retains a 

10 biological activity of naturally occurring Cayman ataxia is 70% homologous to wild-type 
Cayman ataxia, preferably 80% homologous to wild-type Cayman ataxia, more 
preferably 90% homologous to wild-type Cayman ataxia, and most preferably 95% 
homologous to wild-type Cayman ataxia. In preferred embodiments, hybridization 
conditions are based on the melting temperature (T m ) of the nucleic acid binding 

15 complex and confer a defined "stringency" as explained above (See e.g., Wahl, et al. 9 
Meth. Enzymol., 152:399-407 [1987], incorporated herein by reference). 

In other embodiments of the present invention, additional alleles of Cayman 
ataxia are provided. In preferred embodiments, alleles result from a polymorphism or 
mutation (i.e., a change in the nucleic acid sequence) and generally produce altered 

20 mRNAs or polypeptides whose structure or function may or may not be altered. Any 

given gene may have none, one or many allelic forms. Common mutational changes that 
give rise to alleles are generally ascribed to deletions, additions or substitutions of nucleic 
acids. Each of these types of changes may occur alone, or in combination with the 
others, and at the rate of one or more times in a given sequence. Examples of the alleles 

25 of the present invention include those encoded by SEQ ID NOs:3 and 1 1 (wild type) and 
disease alleles (SEQ ID NOs: 8 and 10; See Experimental Section below). 

In still other embodiments of the present invention, the nucleotide sequences of 
the present invention may be engineered in order to alter an Cayman ataxia coding 
sequence for a variety of reasons, including but not limited to, alterations which modify 

30 the cloning, processing and/or expression of the gene product. For example, mutations 
may be introduced using techniques that are well known in the art (e.g, site-directed 
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mutagenesis to insert new restriction sites, to alter glycosylation patterns, to change 
codon preference, etc.). 

In some embodiments of the present invention, the polynucleotide sequence of 
Cayman ataxia may be extended utilizing the nucleotide sequence (e.g., SEQ ID NO: 3 or 
5 1 1) in various methods known in the art to detect upstream sequences such as promoters 
and regulatory elements. For example, it is contemplated that restriction-site polymerase 
chain reaction (PCR) will find use in the present invention. This is a direct method that 
uses universal primers to retrieve unknown sequence adjacent to a known locus 
(Gobinda et al., PCR Methods Applic, 2:318-22 [1993]). First, genomic DNA is 

10 amplified in the presence of a primer to a linker sequence and a primer specific to the 
known region. The amplified sequences are then subjected to a second round of PCR 
with the same linker primer and another specific primer internal to the first one. Products 
of each round of PCR are transcribed with an appropriate RNA polymerase and 
sequenced using reverse transcriptase. 

15 In another embodiment, inverse PCR can be used to amplify or extend sequences 

using divergent primers based on a known region (Triglia et ai, Nucleic Acids Res., 
16:8186 [1988]). The primers may be designed using Oligo 4.0 (National Biosciences 
Inc, Plymouth Minn.), or another appropriate program, to be 22-30 nucleotides in length, 
to have a GC content of 50% or more, and to anneal to the target sequence at 

20 temperatures about 68-72°C. The method uses several restriction enzymes to generate a 

* 

suitable fragment in the known region of a gene. The fragment is then circularized by 
intramolecular ligation and used as a PCR template. In still other embodiments, walking 
PCR is utilized. Walking PCR is a method for targeted gene walking that permits 
retrieval of unknown sequence (Parker et aL 9 Nucleic Acids Res., 19:3055-60 [1991]). 

25 The PROMOTERFINDER kit (Clontech) uses PCR, nested primers and special libraries 
to "walk in" genomic DNA. This process avoids the need to screen libraries and is useful 
in finding intron/exon junctions. 

Preferred libraries for screening for full length cDNAs include mammalian 
libraries that have been size-selected to include larger cDNAs. Also, random primed 

30 libraries are preferred, in that they will contain more sequences that contain the 5 1 and 
upstream gene regions. A randomly primed library may be particularly useful in case 
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where an oligo d(T) library does not yield full-length cDNA. Genomic mammalian 
libraries are useful for obtaining introns and extending 5' sequence. 

In other embodiments of the present invention, variants of the disclosed Cayman 
ataxia sequences are provided. In preferred embodiments, variants result from 
5 polymorphisms or mutations (i.e., a change in the nucleic acid sequence) and generally 
produce altered mRNAs or polypeptides whose structure or function may or may not be 
altered. Any given gene may have none, one, or many variant forms. Common 
mutational changes that give rise to variants are generally ascribed to deletions, additions 
or substitutions of nucleic acids. Each of these types of changes may occur alone, or in 

10 combination with the others, and at the rate of one or more times in a given sequence. 

It is contemplated that it is possible to modify the structure of a peptide having a 
function (e.g., Cayman ataxia function) for such purposes as altering the biological 
activity (e.g., prevention of disease). Such modified peptides are considered functional 
equivalents of peptides having an activity of Cayman ataxia as defined herein. A 

1 5 modified peptide can be produced in which the nucleotide sequence encoding the 
polypeptide has been altered, such as by substitution, deletion, or addition. In 
particularly preferred embodiments, these modifications do not significantly reduce the 
biological activity of the modified Cayman ataxia. In other words, construct "X" can be 
evaluated in order to determine whether it is a member of the genus of modified or 

20 variant Cayman ataxia's of the present invention as defined functionally, rather than 
structurally. In preferred embodiments, the activity of variant Cayman ataxia 
polypeptides is evaluated by methods described herein (e.g., the generation of transgenic 
animals). 

Moreover, as described above, variant forms of Cayman ataxia are also 
25 contemplated as being equivalent to those peptides and DNA molecules that are set forth 
in more detail herein. For example, it is contemplated that isolated replacement of a 
leucine with an isoleucine or valine, an aspartate with a glutamate, a threonine with a 
serine, or a similar replacement of an amino acid with a structurally related amino acid 
(i.e., conservative mutations) will not have a major effect on the biological activity of the 
30 resulting molecule. Accordingly, some embodiments of the present invention provide 
variants of Cayman ataxia disclosed herein containing conservative replacements. 
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Conservative replacements are those that take place within a family of amino acids that 
are related in their side chains. Genetically encoded amino acids can be divided into four 
families: (1) acidic (aspartate, glutamate); (2) basic (lysine, arginine, histidine); (3) 
nonpolar (alanine, valine, leucine, isoleucine, proline, phenylalanine, methionine, 
5 tryptophan); and (4) uncharged polar (glycine, asparagine, glutamine, cysteine, serine, 
threonine, tyrosine). Phenylalanine, tryptophan, and tyrosine are sometimes classified 
jointly as aromatic amino acids. In similar fashion, the amino acid repertoire can be 
grouped as (1) acidic (aspartate, glutamate); (2) basic (lysine, arginine, histidine), (3) 
aliphatic (glycine, alanine, valine, leucine, isoleucine, serine, threonine), with serine and 

10 threonine optionally be grouped separately as aliphatic-hydroxyl; (4) aromatic 

(phenylalanine, tyrosine, tryptophan); (5) amide (asparagine, glutamine); and (6) sulfur 
-containing (cysteine and methionine) (e.g., Stryer ed., Biochemistry, pg. 17-21, 2nd ed, 
WH Freeman and Co., 1981). Whether a change in the amino acid sequence of a peptide 
results in a functional polypeptide can be readily determined by assessing the ability of 

15 the variant peptide to function in a fashion similar to the wild-type protein. Peptides 
having more than one replacement can readily be tested in the same manner. 

More rarely, a variant includes "nonconservative" changes (e.g., replacement of a 
glycine with a tryptophan). Analogous minor variations can also include amino acid 
deletions or insertions, or both. Guidance in determining which amino acid residues can 

20 be substituted, inserted, or deleted without abolishing biological activity can be found 
using computer programs (e.g., LASERGENE software, DNASTAR Inc., Madison, 
Wis.). 

As described in more detail below, variants may be produced by methods such as 
directed evolution or other techniques for producing combinatorial libraries of variants, 

25 described in more detail below. In still other embodiments of the present invention, the 
nucleotide sequences of the present invention may be engineered in order to alter a 
Cayman ataxia coding sequence including, but not limited to, alterations that modify the 
cloning, processing, localization, secretion, and/or expression of the gene product. For 
example, mutations may be introduced using techniques that are well known in the art 

30 (e.g., site-directed mutagenesis to insert new restriction sites, alter glycosylation patterns, 
or change codon preference, etc.). 
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II. Cayman ataxia Polypeptides 

In other embodiments, the present invention provides Cayman ataxia 
polynucleotide sequences that encode Cayman ataxia polypeptide sequences. Cayman 
5 ataxia polypeptides (e.g., SEQ ID NOs: 4 and 9) are described in Figures 4 and 9. Other 
embodiments of the present invention provide fragments, fusion proteins or functional 
equivalents of these Cayman ataxia proteins. In some embodiments, the present 
invention provides truncation mutants of Cayman ataxia. In still other embodiment of the 
present invention, nucleic acid sequences corresponding to Cayman ataxia variants, 

10 homologs, and mutants may be used to generate recombinant DNA molecules that direct 
the expression of the Cayman ataxia variants, homologs, and mutants in appropriate host 
cells. In some embodiments of the present invention, the polypeptide may be a naturally 
purified product, in other embodiments it may be a product of chemical synthetic 
procedures, and in still other embodiments it may be produced by recombinant 

15 techniques using a prokaryotic or eukaryotic host (e.g., by bacterial, yeast, higher plant, 
insect and mammalian cells in culture). In some embodiments, depending upon the host 
employed in a recombinant production procedure, the polypeptide of the present 
invention may be glycosylated or may be non-glycosylated. In other embodiments, the 
polypeptides of the invention may also include an initial methionine amino acid residue. 

20 In one embodiment of the present invention, due to the inherent degeneracy of the 

genetic code, DNA sequences other than the polynucleotide sequences of SEQ ID NOs: 
3, 1 1, 8, and 10 that encode substantially the same or a functionally equivalent amino 
acid sequence, may be used to clone and express Cayman ataxia. In general, such 
polynucleotide sequences hybridize to SEQ ID NOs: 3, 1 1, 8, and 10 under conditions of 

25 high to medium stringency as described above. As will be understood by those of skill in 
the art, it may be advantageous to produce Cayman ataxia-encoding nucleotide sequences 
possessing non-naturally occurring codons. Therefore, in some preferred embodiments, 
codons preferred by a particular prokaryotic or eukaryotic host (Murray et al. 9 Nucl. 
Acids Res., 17 [1989]) are selected, for example, to increase the rate of Cayman ataxia 

30 expression or to produce recombinant RNA transcripts having desirable properties, such 
as a longer half-life, than transcripts produced from naturally occurring sequence. 
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1. Vectors for Production of Cayman ataxia 

The polynucleotides of the present invention may be employed for producing 
polypeptides by recombinant techniques. Thus, for example, the polynucleotide may be 
5 included in any one of a variety of expression vectors for expressing a polypeptide. In 
some embodiments of the present invention, vectors include, but are not limited to, 
chromosomal, nonchromosomal and synthetic DNA sequences (e.g., derivatives of SV40, 
bacterial plasmids, phage DNA; baculovirus, yeast plasmids, vectors derived from 
combinations of plasmids and phage DNA, and viral DNA such as vaccinia, adenovirus, 

10 fowl pox virus, and pseudorabies). It is contemplated that any vector may be used as 
long as it is replicable and viable in the host. 

In particular, some embodiments of the present invention provide recombinant 
constructs comprising one or more of the sequences as broadly described above (e.g., 
SEQ ID NOs: 3, 1 1, 8, and 10). In some embodiments of the present invention, the 

15 constructs comprise a vector, such as a plasmid or viral vector, into which a sequence of 
the invention has been inserted, in a forward or reverse orientation. In still other 
embodiments, the heterologous structural sequence (e.g., SEQ ID NO:3) is assembled in 
appropriate phase with translation initiation and termination sequences. In preferred 
embodiments of the present invention, the appropriate DNA sequence is inserted into the 

20 vector using any of a variety of procedures. In general, the DNA sequence is inserted 
into an appropriate restriction endonuclease site(s) by procedures known in the art. 

Large numbers of suitable vectors are known to those of skill in the art, and are 
commercially available. Such vectors include, but are not limited to, the following 
vectors: 1) Bacterial -- pQE70, pQE60, pQE-9 (Qiagen), pBS, pDIO, phagescript, 

25 psiX174, pbluescript SK, pBSKS, pNH8A, pNH16a, pNH18A, pNH46A (Stratagene); 
ptrc99a, pKK223-3, pKK233-3, pDR540, pRIT5 (Pharmacia); 2) Eukaryotic - 
pWLNEO, pSV2CAT, pOG44, PXT1, pSG (Stratagene) pSVK3, pBPV, pMSG, pSVL 
(Pharmacia); and 3) Baculovirus - pPbac and pMbac (Stratagene). Any other plasmid or 
vector may be used as long as they are replicable and viable in the host. In some 

30 preferred embodiments of the present invention, mammalian expression vectors comprise 
an origin of replication, a suitable promoter and enhancer, and also any necessary 
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ribosome binding sites, polyadenylation sites, splice donor and acceptor sites, 
transcriptional termination sequences, and 5' flanking non-transcribed sequences. In 
other embodiments, DNA sequences derived from the SV40 splice, and polyadenylation 
sites may be used to provide the required non-transcribed genetic elements. 

In certain embodiments of the present invention, the DNA sequence in the 
expression vector is operatively linked to an appropriate expression control sequence(s) 
(promoter) to direct mRNA synthesis. Promoters useful in the present invention include, 
but are not limited to, the LTR or SV40 promoter, the E. coli lac or trp, the phage lambda 
PL and Pr, T3 and T7 promoters, and the cytomegalovirus (CMV) immediate early, 
herpes simplex virus (HSV) thymidine kinase, and mouse metallothionein-I promoters 
and other promoters known to control expression of gene in prokaryotic or eukaryotic 
cells or their viruses. In other embodiments of the present invention, recombinant 
expression vectors include origins of replication and selectable markers permitting 
transformation of the host cell {e.g., dihydrofolate reductase or neomycin resistance for 
eukaryotic cell culture, or tetracycline or ampicillin resistance in E. coli). 

In some embodiments of the present invention, transcription of the DNA encoding 
the polypeptides of the present invention by higher eukaryotes is increased by inserting 
an enhancer sequence into the vector. Enhancers are c/s-acting elements of DNA, usually 
about from 10 to 300 bp that act on a promoter to increase its transcription. Enhancers 
useful in the present invention include, but are not limited to, the SV40 enhancer on the 
late side of the replication origin bp 100 to 270, a cytomegalovirus early promoter 
enhancer, the polyoma enhancer on the late side of the replication origin, and adenovirus 
enhancers. 

In other embodiments, the expression vector also contains a ribosome binding site 
for translation initiation and a transcription terminator. In still other embodiments of the 
present invention, the vector may also include appropriate sequences for amplifying 
expression. 

2. Host Cells for Production of Cayman ataxia 

In a further embodiment, the present invention provides host cells containing the 
above-described constructs. In some embodiments of the present invention, the host cell 
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is a higher eukaryotic cell (e.g., a mammalian or insect cell). In other embodiments of 
the present invention, the host cell is a lower eukaryotic cell (e.g., a yeast cell). In still 
other embodiments of the present invention, the host cell can be a prokaryotic cell (e.g., a 
bacterial cell). Specific examples of host cells include, but are not limited to, Escherichia 
5 coli, Salmonella typhimurium, Bacillus subtilis, and various species within the genera 
Pseudomonas, Streptomyces, and Staphylococcus, as well as Saccharomycees cerivisiae, 
Schizosaccharomycees pombe, Drosophila S2 cells, Spodoptera Sf9 cells, Chinese 
hamster ovary (CHO) cells, COS-7 lines of monkey kidney fibroblasts, (Gluzman, Cell 
23:175 [1981]), C127, 3T3, 293, 293T, HeLa and BHK cell lines. 

10 The constructs in host cells can be used in a conventional manner to produce the 

gene product encoded by the recombinant sequence. In some embodiments, introduction 
of the construct into the host cell can be accomplished by calcium phosphate transfection, 
DEAE-Dextran mediated transfection, or electroporation (See e.g., Davis et al, Basic 
Methods in Molecular Biology, [1986]). Alternatively, in some embodiments of the 

1 5 present invention, the polypeptides of the invention can be synthetically produced by 
conventional peptide synthesizers. 

Proteins can be expressed in mammalian cells, yeast, bacteria, or other cells under 
the control of appropriate promoters. Cell-free translation systems can also be employed 
to produce such proteins using RNAs derived from the DNA constructs of the present 

20 invention. Appropriate cloning and expression vectors for use with prokaryotic and 
eukaryotic hosts are described by Sambrook, et al, Molecular Cloning: A Laboratory 
Manual, Second Edition, Cold Spring Harbor, N.Y., [1989]. 

In some embodiments of the present invention, following transformation of a 
suitable host strain and growth of the host strain to an appropriate cell density, the 

25 selected promoter is induced by appropriate means (e.g., temperature shift or chemical 
induction) and cells are cultured for an additional period. In other embodiments of the 
present invention, cells are typically harvested by centrifugation, disrupted by physical or 
chemical means, and the resulting crude extract retained for further purification. In still 
other embodiments of the present invention, microbial cells employed in expression of 

30 proteins can be disrupted by any convenient method, including freeze-thaw cycling, 
sonication, mechanical disruption, or use of cell lysing agents. 
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3. Purification of Cayman ataxia 

The present invention also provides methods for recovering and purifying 
Cayman ataxia from recombinant cell cultures including, but not limited to, ammonium 
5 sulfate or ethanol precipitation, acid extraction, anion or cation exchange 
chromatography, phosphocellulose chromatography, hydrophobic interaction 
chromatography, affinity chromatography, hydroxylapatite chromatography and lectin 
chromatography. In other embodiments of the present invention, protein-refolding steps 
can be used as necessary, in completing configuration of the mature protein. In still other 

10 embodiments of the present invention, high performance liquid chromatography (HPLC) 
can be employed for final purification steps. 

The present invention further provides polynucleotides having the coding 
sequence {e.g., SEQ ID NOs: 3, 1 1, 8, and 10) fused in frame to a marker sequence that 
allows for purification of the polypeptide of the present invention. A non-limiting 

1 5 example of a marker sequence is a hexahistidine tag which may be supplied by a vector, 
preferably a pQE-9 vector, which provides for purification of the polypeptide fused to the 
marker in the case of a bacterial host, or, for example, the marker sequence may be a 
hemagglutinin (HA) tag when a mammalian host (e.g., COS-7 cells) is used. The HA tag 
corresponds to an epitope derived from the influenza hemagglutinin protein (Wilson et 

20 al, Cell, 37:767 [1984]). 
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4. Truncation Mutants of Cayman ataxia 

In addition, the present invention provides fragments of Cayman ataxia (i.e., 
truncation mutants). In some embodiments of the present invention, when expression of 
a portion of the Cayman ataxia protein is desired, it may be necessary to add a start codon 
5 (ATG) to the oligonucleotide fragment containing the desired sequence to be expressed. 
It is well known in the art that a methionine at the N-terminal position can be 
enzymatically cleaved by the use of the enzyme methionine aminopeptidase (MAP). 
MAP has been cloned from E. coli (Ben-Bassat et al. 9 J. Bacteriol., 169:751 [1987]) and 
Salmonella typhimurium and its in vitro activity has been demonstrated on recombinant 
10 proteins (Miller et ai 9 Proc. Natl. Acad. Sci. USA 84:2718 [1990]). Therefore, removal 
of an N-terminal methionine, if desired, can be achieved either in vivo by expressing such 
recombinant polypeptides in a host which produces MAP (e.g., E. coli or CM89 or S. 
cerivisiae), or in vitro by use of purified MAP. 

15 5. Fusion Proteins Containing Cayman ataxia 

The present invention also provides fusion proteins incorporating all or part of 
Cayman ataxia. Accordingly, in some embodiments of the present invention, the coding 
sequences for the polypeptide can be incorporated as a part of a fusion gene including a 
nucleotide sequence encoding a different polypeptide. It is contemplated that this type of 

20 expression system will find use under conditions where it is desirable to produce an 

immunogenic fragment of a Cayman ataxia protein. In some embodiments of the present 
invention, the VP6 capsid protein of rotavirus is used as an immunologic carrier protein 
for portions of the Cayman ataxia polypeptide, either in the monomeric form or in the 
form of a viral particle. In other embodiments of the present invention, the nucleic acid 

25 sequences corresponding to the portion of Cayman ataxia against which antibodies are to 
be raised can be incorporated into a fusion gene construct which includes coding 
sequences for a late vaccinia virus structural protein to produce a set of recombinant 
viruses expressing fusion proteins comprising a portion of Cayman ataxia as part of the 
virion. It has been demonstrated with the use of immunogenic fusion proteins utilizing 

30 the hepatitis B surface antigen fusion proteins that recombinant hepatitis B virions can be 
utilized in this role as well. Similarly, in other embodiments of the present invention, 
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chimeric constructs coding for fusion proteins containing a portion of Cayman ataxia and 
the poliovirus capsid protein are created to enhance immunogenicity of the set of 
polypeptide antigens (See e.g., EP Publication No. 025949; and Evans et al, Nature 
339:385 [1989]; Huang et al, J. Virol, 62:3855 [1988]; and Schlienger et al, J. Virol, 
5 66:2 [1992]). 

In still other embodiments of the present invention, the multiple antigen peptide 
system for peptide-based immunization can be utilized. In this system, a desired portion 
of Cayman ataxia is obtained directly from organo-chemical synthesis of the peptide onto 
an oligomeric branching lysine core (see e.g., Posnett et al, J. Biol. Chem, 263:1719 

10 [1988]; and Nardelli et al, J. Immunol, 148:914 [1992]). In other embodiments of the 
present invention, antigenic determinants of the Cayman ataxia proteins can also be 
expressed and presented by bacterial cells. 

In addition to utilizing fusion proteins to enhance immunogenicity, it is widely 
appreciated that fusion proteins can also facilitate the expression of proteins, such as the 

1 5 Cayman ataxia protein of the present invention. Accordingly, in some embodiments of 
the present invention, Cayman ataxia can be generated as a glutathiorie-S-transferase (i.e., 
GST fusion protein). It is contemplated that such GST fusion proteins will enable easy 
purification of Cayman ataxia, such as by the use of glutathione-derivatized matrices (See 
e.g., Ausabel et al. (eds.), Current Protocols in Molecular Biology, John Wiley & Sons, 

20 NY [ 1 99 1 ]). In another embodiment of the present invention, a fusion gene coding for a 
purification leader sequence, such as a poly-(His)/enterokinase cleavage site sequence at 
the N-terminus of the desired portion of Cayman ataxia, can allow purification of the 
expressed Cayman ataxia fusion protein by affinity chromatography using a Ni 2+ metal 
resin. In still another embodiment of the present invention, the purification leader 

25 sequence can then be subsequently removed by treatment with enterokinase (See e.g., 
Hochuli et al, J. Chromatogr., 41 1:177 [1987]; and Janknecht et al, Proc. Natl. Acad. 
Sci. USA 88:8972). 

Techniques for making fusion genes are well known. Essentially, the joining of 
various DNA fragments coding for different polypeptide sequences is performed in 
30 accordance with conventional techniques, employing blunt-ended or stagger-ended 
termini for ligation, restriction enzyme digestion to provide for appropriate termini, 
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filling-in of cohesive ends as appropriate, alkaline phosphatase treatment to avoid 
undesirable joining, and enzymatic ligation. In another embodiment of the present 
invention, the fusion gene can be synthesized by conventional techniques including 
automated DNA synthesizers. Alternatively, in other embodiments of the present 
5 invention, PCR amplification of gene fragments can be carried out using anchor primers 
which give rise to complementary overhangs between two consecutive gene fragments 
which can subsequently be annealed to generate a chimeric gene sequence (See e.g., 
Current Protocols in Molecular Biology, supra). 

10 6. Variants of Cayman ataxia 

Still other embodiments of the present invention provide mutant or variant forms 
of Cayman ataxia (i.e., muteins). It is possible to modify the structure of a peptide having 
an activity of Cayman ataxia for such purposes as enhancing therapeutic or prophylactic 
efficacy, or stability (e.g., ex vivo shelf life, and/or resistance to proteolytic degradation in 

15 vivo). Such modified peptides are considered functional equivalents of peptides having 
an activity of the subject Cayman ataxia proteins as defined herein. A modified peptide 
can be produced in which the amino acid sequence has been altered, such as by amino 
acid substitution, deletion, or addition. 

Moreover, as described above, variant forms (e.g., mutants or polymorphic 

20 sequences) of the subject Cayman ataxia proteins are also contemplated as being 

equivalent to those peptides and DNA molecules that are set forth in more detail. For 
example, as described above, the present invention encompasses mutant and variant 
proteins that contain conservative or non-conservative amino acid substitutions. 

This invention further contemplates a method of generating sets of combinatorial 

25 mutants of the present Cayman ataxia proteins, as well as truncation mutants, and is 

especially useful for identifying potential variant sequences (i.e., mutants or polymorphic , 
sequences) that are involved in disease (e.g., ataxia or other movement diseases). The 
purpose of screening such combinatorial libraries is to generate, for example, novel 
Cayman ataxia variants that can act as either agonists or antagonists, or alternatively, 

30 possess novel activities all together. 
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Therefore, in some embodiments of the present invention, Cayman ataxia variants 
are engineered by the present method to provide altered (e.g., increased or decreased) 
biological activity. In other embodiments of the present invention, 
combinatorially-derived variants are generated which have a selective potency relative to 
5 a naturally occurring Cayman ataxia. Such proteins, when expressed from recombinant 
DNA constructs, can be used in gene therapy protocols. 

Still other embodiments of the present invention provide Cayman ataxia variants 
that have intracellular half-lives dramatically different than the corresponding wild-type 
protein. For example, the altered protein can be rendered either more stable or less stable 

10 to proteolytic degradation or other cellular process that result in destruction of, or 

otherwise inactivate Cayman ataxia. Such variants, and the genes which encode them, 
can be utilized to alter the location of Cayman ataxia expression by modulating the 
half-life of the protein. For instance, a short half-life can give rise to more transient 
Cayman ataxia biological effects and, when part of an inducible expression system, can 

15 allow tighter control of Cayman ataxia levels within the cell. As above, such proteins, 
and particularly their recombinant nucleic acid constructs, can be used in gene therapy 
protocols. 

In still other embodiments of the present invention, Cayman ataxia variants are 
generated by the combinatorial approach to act as antagonists, in that they are able to 

20 interfere with the ability of the corresponding wild-type protein to regulate cell function. 

In some embodiments of the combinatorial mutagenesis approach of the present 
invention, the amino acid sequences for a population of Cayman ataxia homologs, 
variants or other related proteins are aligned, preferably to promote the highest homology 
possible. Such a population of variants can include, for example, Cayman ataxia 

25 homologs from one or more species, or Cayman ataxia variants from the same species but 
which differ due to mutation or polymorphisms. Amino acids that appear at each 
position of the aligned sequences are selected to create a degenerate set of combinatorial 
sequences. 

In a preferred embodiment of the present invention, the combinatorial Cayman 
30 ataxia library is produced by way of a degenerate library of genes encoding a library of 
polypeptides which each include at least a portion of potential Cayman ataxia protein 
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sequences. For example, a mixture of synthetic oligonucleotides can be enzymatically 
ligated into gene sequences such that the degenerate set of potential Cayman ataxia 
sequences are expressible as individual polypeptides, or alternatively, as a set of larger 
fusion proteins (e.g., for phage display) containing the set of Cayman ataxia sequences 
5 therein. 

There are many ways by which the library of potential Cayman ataxia homologs 
and variants can be generated from a degenerate oligonucleotide sequence. In some 
embodiments, chemical synthesis of a degenerate gene sequence is carried out in an 
automatic DNA synthesizer, and the synthetic genes are ligated into an appropriate gene 

10 for expression. The purpose of a degenerate set of genes is to provide, in one mixture, all 
of the sequences encoding the desired set of potential Cayman ataxia sequences. The 
synthesis of degenerate oligonucleotides is well known in the art (See e.g., Narang, 
Tetrahedron Lett., 39:39 [1983]; Itakura et al. 9 Recombinant DNA, in Walton (ed.), 
Proceedings of the 3rd Cleveland Symposium on Macromolecules, Elsevier, Amsterdam, 

15 pp 273-289 [1981]; Itakura et aL, Annu. Rev. Biochem., 53:323 [1984]; Itakura et aL, 
Science 198:1056 [1984]; Ike et aL, Nucl. Acid Res., 1 1:477 [1983]). Such techniques 
have been employed in the directed evolution of other proteins (See e.g., Scott et aL, 
Science 249:386 [1980]; Roberts etaL, Proc. Natl. Acad. Sci. USA 89:2429 [1992]; 
Devlin et aL, Science 249: 404 [1990]; Cwirla et aL, Proc. Natl. Acad. Sci. USA 87: 6378 

20 [1990]; each of which is herein incorporated by reference; as well as U.S. Pat. Nos. 

5,223,409, 5,198,346, and 5,096,815; each of which is incorporated herein by reference). 

It is contemplated that the Cayman ataxia nucleic acids (e.g., SEQ ID NO:3, and 
fragments and variants thereof) can be utilized as starting nucleic acids for directed 
evolution. These techniques can be utilized to develop Cayman ataxia variants having 

25 desirable properties such as increased or decreased biological activity. 

In some embodiments, artificial evolution is performed by random mutagenesis 
(e.g., by utilizing error-prone PCR to introduce random mutations into a given coding 
sequence). This method requires that the frequency of mutation be finely tuned. As a 
general rule, beneficial mutations are rare, while deleterious mutations are common. This 

30 is because the combination of a deleterious mutation and a beneficial mutation often 

results in an inactive enzyme. The ideal number of base substitutions for targeted gene is 
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usually between 1.5 and 5 (Moore and Arnold, Nat. Biotech., 14, 458 [1996]; Leung et 
a/., Technique, 1:11 [1989]; Eckert and Kunkel, PCR Methods Appl., 1:17-24 [1991]; 
Caldwell and Joyce, PCR Methods Appl., 2:28 [1992]; and Zhao and Arnold, Nuc. Acids. 
Res., 25: 1307 [1997]). After mutagenesis, the resulting clones are selected for desirable 
5 activity (e.g., screened for Cayman ataxia activity). Successive rounds of mutagenesis 
and selection are often necessary to develop enzymes with desirable properties. It should 
be noted that only the useful mutations are carried over to the next round of mutagenesis. 

In other embodiments of the present invention, the polynucleotides of the present 
invention are used in gene shuffling or sexual PCR procedures (e.g., Smith, Nature, 

10 370:324 [1994]; U.S. Pat. Nos. 5,837,458; 5,830,721; 5,811,238; 5,733,731; all of which 
are herein incorporated by reference). Gene shuffling involves random fragmentation of 
several mutant DNAs followed by their reassembly by PCR into full length molecules. 
Examples of various gene shuffling procedures include, but are not limited to, assembly 
following DNase treatment, the staggered extension process (STEP), and random priming 

15 in vitro recombination. In the DNase mediated method, DNA segments isolated from a 
pool of positive mutants are cleaved into random fragments with DNasel and subjected to 
multiple rounds of PCR with no added primer. The lengths of random fragments 
approach that of the uncleaved segment as the PCR cycles proceed, resulting in mutations 
in present in different clones becoming mixed and accumulating in some of the resulting 

20 sequences. Multiple cycles of selection and shuffling have led to the functional 

enhancement of several enzymes (Stemmer, Nature, 370:398 [1994]; Stemmer, Proc. 
Natl. Acad. Sci. USA, 91:10747 [1994]; Crameri et ai 9 Nat. Biotech., 14:315 [1996]; 
Zhang et al., Proc. Natl. Acad. Sci. USA, 94:4504 [1997]; and Crameri et aL 9 Nat. 
Biotech., 15:436 [1997]). Variants produced by directed evolution can be screened for 

25 Cayman ataxia activity by the methods described herein. 

A wide range of techniques are known in the art for screening gene products of 
combinatorial libraries made by point mutations, and for screening cDNA libraries for 
gene products having a certain property. Such techniques will be generally adaptable for- 
rapid screening of the gene libraries generated by the combinatorial mutagenesis or 

30 recombination of Cayman ataxia homologs or variants. The most widely used techniques 
for screening large gene libraries typically comprises cloning the gene library into 
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replicable expression vectors, transforming appropriate cells with the resulting library of 
vectors, and expressing the combinatorial genes under conditions in which detection of a 
desired activity facilitates relatively easy isolation of the vector encoding the gene whose 
.product was detected. 

5 

7. Chemical Synthesis of Cayman ataxia 

In an alternate embodiment of the invention, the coding sequence of Cayman 
ataxia is synthesized, whole or in part, using chemical methods well known in the art (See 
e.g., Caruthers et aL, Nucl. Acids Res. Symp. Ser., 7:215 [1980]; Crea and Horn, Nucl. 

10 Acids Res., 9:2331 [1980]; Matteucci and Caruthers, Tetrahedron Lett., 21:719 [1980]; 
and Chow and Kempe, Nucl. Acids Res., 9:2807 [1981]). In other embodiments of the 
present invention, the protein itself is produced using chemical methods to synthesize 
either an entire Cayman ataxia amino acid sequence or a portion thereof. For example, 
peptides can be synthesized by solid phase techniques, cleaved from the resin, and 

1 5 purified by preparative high performance liquid chromatography (See e.g. , Creighton, 
Proteins Structures And Molecular Principles, W H Freeman and Co, New York N. Y. 
[1983]). In other embodiments of the present invention, the composition of the synthetic 
peptides is confirmed by amino acid analysis or sequencing (See e.g., Creighton, supra). 
Direct peptide synthesis can be performed using various solid-phase techniques 

20 (Roberge et aL, Science 269:202 [1995]) and automated synthesis may be achieved, for 
example, using ABI 431 A Peptide Synthesizer (Perkin Elmer) in accordance with the 
instructions provided by the manufacturer. Additionally, the amino acid sequence of 
Cayman ataxia, or any part thereof, may be altered during direct synthesis and/or 
combined using chemical methods with other sequences to produce a variant polypeptide. 

25 

III, Detection of Cayman ataxia Alleles 

In some embodiments, the present invention provides methods of detecting the 
presence of wild or variant (e.g., mutant or polymorphic) Cayman ataxia nucleic acids or 
polypeptides. The detection of mutant Cayman ataxia polypeptides finds use in the 
30 diagnosis of disease (e.g. , ataxia and other movement diseases). 
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A. Cayman ataxia Alleles 

In some embodiments, the present invention includes alleles of Cayman ataxia 
that increase a patient's susceptibility to Caymans ataxia and other movement diseases 
such as ataxia, dystonia or myoclonus (e.g., including, variants of SEQ ED NO:3 and 1 1 
5 such as SEQ ID NOs8 and 10; See e.g., the illustrative Examples below). However, the 
present invention is not limited to the mutations described herein. Any mutation that 
results in the undesired phenotype (e.g., ataxia) is within the scope of the present 
invention. 

10 B. Detection of Cayman ataxia Alleles 

Accordingly, the present invention provides methods for determining whether a 
patient has an increased susceptibility to Caymans ataxia or other movement disorders by 
determining whether the individual has a variant Cayman ataxia allele. In other 
embodiments, the present invention provides methods for providing a prognosis of 
15 increased risk for ataxia to an individual based on the presence or absence of one or more 
variant alleles of Cayman ataxia. In some embodiments, diagnosis is prenatal diagnosis, 
allowing a couple to assess their risk of having a child with a mutant Cayman ataxia 
allele. 

In other embodiments, the present invention provides methods of determining an 
20 individuals Cayman ataxia carrier status. The present invention is not limited to a 

particular mechanism. Indeed, an understanding of the mechanism is not necessary to 
practice the present invention. Nonetheless, it is contemplated that Cayman ataxia is 
often a recessive disorder. Thus, two mutant alleles are necessary for disease states. 
Accordingly, in some embodiments, couples contemplating starting a family are screened 
25 to determine both parents 1 carrier status. This allows for in utero testing (e.g., 
amniocentesis) only in cases where both parents are carriers. 

A number of methods are available for analysis of variant (e.g., mutant or 
polymorphic) nucleic acid sequences. Assays for detection variants (e.g., polymorphisms 
or mutations) fall into several categories, including, but not limited to direct sequencing 
30 assays, fragment polymorphism assays, hybridization assays, and computer based data 
analysis. Protocols and commercially available kits or services for performing multiple 
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variations of these assays are available. In some embodiments, assays are performed in 
combination or in hybrid (e.g., different reagents or technologies from several assays are 
combined to yield one assay). The following assays are useful in the present invention. 

5 1. Direct sequencing Assays 

In some embodiments of the present invention, variant sequences are detected 
using a direct sequencing technique. In these assays, DNA samples are first isolated from 
a subject using any suitable method. In some embodiments, the region of interest is 
cloned into a suitable vector and amplified by growth in a host cell (e.g., a bacteria). In 
10 other embodiments, DNA in the region of interest is amplified using PCR. 

Following amplification, DNA in the region of interest (e.g., the region containing 
the SNP or mutation of interest) is sequenced using any suitable method, including but 
not limited to manual sequencing using radioactive marker nucleotides, or automated 
sequencing. The results of the sequencing are displayed using any suitable method. The 
15 sequence is examined and the presence or absence of a given SNP or mutation is 
determined. 

2. PCR Assay 

In some embodiments of the present invention, variant sequences are detected 
20 using a PCR-based assay. In some embodiments, the PCR assay comprises the use of 
oligonucleotide primers that hybridize only to the variant or wild type allele of Cayman 
ataxia (e.g., to the region of polymorphism or mutation). Both sets of primers are used to 
amplify a sample of DNA. If only the mutant primers result in a PCR product, then the 
patient has the mutant Cayman ataxia allele. If only the wild-type primers result in a 
25 PCR product, then the patient has the wild type allele of Cayman ataxia. 

3. Mutational detection by dHPLC 

In some embodiments of the present invention, variant sequences are detected 
using a PCR-based assay with consecutive detection of nucleotide variants by dHPLC 
30 (denaturing high performance liquid chromatography). Exemplary systems and 
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Methods for dHPLC include, but are not limited to, WAVE (Transgenomic, Inc; Omaha, 
NE) or VARIAN equipment (Palo Alto, CA). 

4. Fragment Length Polymorphism Assays 
5 In some embodiments of the present invention, variant sequences are detected 

using a fragment length polymorphism assay. In a fragment length polymorphism assay, 
a unique DNA banding pattern based on cleaving the DNA at a series of positions is 
generated using an enzyme (e.g., a restriction enzyme or a CLEAVASE I [Third Wave 
Technologies, Madison, WI] enzyme). DNA fragments from a sample containing a SNP 
10 or a mutation will have a different banding pattern than wild type. 

a, RFLP Assay 

In some embodiments of the present invention, variant sequences are detected 
using a restriction fragment length polymorphism assay (RFLP). The region of interest is 
1 5 first isolated using PCR. The PCR products are then cleaved with restriction enzymes 
known to give a unique length fragment for a given polymorphism. The restriction- 
enzyme digested PCR products are separated by agarose gel electrophoresis and 
visualized by ethidium bromide staining. The length of the fragments is compared to 
molecular weight markers and fragments generated from wild-type and mutant controls. 

20 

b. CFLP Assay 

In other embodiments, variant sequences are detected using a CLEAVASE 
fragment length polymorphism assay (CFLP; Third Wave Technologies, Madison, WI; 
See e.g., U.S. Patent Nos. 5,843,654; 5,843,669; 5,719,208; and 5,888,780; each of which 

25 is herein incorporated by reference). This assay is based on the observation that when 
single strands of DNA fold on themselves, they assume higher order structures that are 
highly individual to the precise sequence of the DNA molecule. These secondary 
structures involve partially duplexed regions of DNA such that single stranded regions 
are juxtaposed with double stranded DNA hairpins. The CLEAVASE I enzyme, is a 

30 structure-specific, thermostable nuclease that recognizes and cleaves the junctions 
between these single-stranded and double-stranded regions. 
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The region of interest is first isolated, for example, using PCR. Then, DNA 
strands are separated by heating. Next, the reactions are cooled to allow intrastrand 
secondary structure to form. The PCR products are then treated with the CLEAVASE I 
enzyme to generate a series of fragments that are unique to a given SNP or mutation. The 
5 CLEAVASE enzyme treated PCR products are separated and detected (e.g., by agarose 
gel electrophoresis) and visualized (e.g., by ethidium bromide staining). The length of 
the fragments is compared to molecular weight markers and fragments generated from 
wild-type and mutant controls. 

10 5. Hybridization Assays 

In preferred embodiments of the present invention, variant sequences are detected 
a hybridization assay. In a hybridization assay, the presence of absence of a given SNP 
or mutation is determined based on the ability of the DNA from the sample to hybridize 
to a complementary DNA molecule (e.g. , a oligonucleotide probe). A variety of 

1 5 hybridization assays using a variety of technologies for hybridization and detection are 
available. A description of a selection of assays is provided below. 

a. Direct Detection of Hybridization 

In some embodiments, hybridization of a probe to the sequence of interest (e.g., a 
20 SNP or mutation) is detected directly by visualizing a bound probe (e.g., a Northern or 
Southern assay; See e.g., Ausabel et al (eds.), Current Protocols in Molecular Biology, 
John Wiley & Sons, NY [1991]). In a these assays, genomic DNA (Southern) or RNA 
(Northern) is isolated from a subject. The DNA or RNA is then cleaved with a series of 
restriction enzymes that cleave infrequently in the genome and not near any of the 
25 markers being assayed. The DNA or RNA is then separated (e.g., on an agarose gel) and 
transferred to a membrane. A labeled (e.g., by incorporating a radionucleotide) probe or 
probes specific for the SNP or mutation being detected is allowed to contact the 
membrane under a condition or low, medium, or high stringency conditions. Unbound 
probe is removed and the presence of binding is detected by visualizing the labeled 
30 probe. 
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b. Detection of Hybridization Using "DNA Chip" Assays 
In some embodiments of the present invention, variant sequences are detected 
using a DNA chip hybridization assay. In this assay, a series of oligonucleotide probes 
are affixed to a solid support. The oligonucleotide probes are designed to be unique to a 
5 given SNP or mutation. The DNA sample of interest is contacted with the DNA "chip" 
and hybridization is detected. 

In some embodiments, the DNA chip assay is a GeneChip (Affymetrix, Santa 
Clara, CA; See e.g., U.S. Patent Nos. 6,045,996; 5,925,525; and 5,858,659; each of which 
is herein incorporated by reference) assay. The GeneChip technology uses miniaturized, 
10 high-density arrays of oligonucleotide probes affixed to a "chip." Probe arrays are 

manufactured by Affymetrix's light-directed chemical synthesis process, which combines 
solid-phase chemical synthesis with photolithographic fabrication techniques employed 
in the semiconductor industry. Using a series of photolithographic masks to define chip 
exposure sites, followed by specific chemical synthesis steps, the process constructs 
15 high-density arrays of oligonucleotides, with each probe in a predefined position in the 
array. Multiple probe arrays are synthesized simultaneously on a large glass wafer. The 
wafers are then diced, and individual probe arrays are packaged in injection-molded 
plastic cartridges, which protect them from the environment and serve as chambers for 
hybridization. 

20 The nucleic acid to be analyzed is isolated, amplified by PCR, and labeled with a 

fluorescent reporter group. The labeled DNA is then incubated with the array using a 
fluidics station. The array is then inserted into the scanner, where patterns of 
hybridization are detected. The hybridization data are collected as light emitted from the 
fluorescent reporter groups already incorporated into the target, which is bound to the 

25 probe array. Probes that perfectly match the target generally produce stronger signals 
than those that have mismatches. Since the sequence and position of each probe on the 
array are known, by complementarity, the identity of the target nucleic acid applied to the 
probe array can be determined. 

In other embodiments, a DNA microchip containing electronically captured 

30 probes (Nanogen, San Diego, CA) is utilized (See e.g., U.S. Patent Nos. 6,017,696; 

6,068,818; and 6,051,380; each of which are herein incorporated by reference). Through 
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the use of microelectronics; Nanogen's technology enables the active movement and 
concentration of charged molecules to and from designated test sites on its semiconductor 
microchip. DNA capture probes unique to a given SNP or mutation are electronically 
placed at, or "addressed" to, specific sites on the microchip. Since DNA has a strong 
5 negative charge, it can be electronically moved to an area of positive charge. 

First, a test site or a row of test sites on the microchip is electronically activated 
with a positive charge. Next, a solution containing the DNA probes is introduced onto 
the microchip. The negatively charged probes rapidly move to the positively charged 
sites, where they concentrate and are chemically bound to a site on the microchip. The 

10 microchip is then washed and another solution of distinct DNA probes is added until the 
array of specifically bound DNA probes is complete. 

A test sample is then analyzed for the presence of target DNA molecules by 
determining which of the DNA capture probes hybridize, with complementary DNA in 
the test sample (e.g., a PCR amplified gene of interest). An electronic charge is also used 

15 to move and concentrate target molecules to one or more test sites on the microchip. The 
electronic concentration of sample DNA at each test site promotes rapid hybridization of 
sample DNA with complementary capture probes (hybridization may occur in minutes). 
To remove any unbound or nonspecifically bound DNA from each site, the polarity or 
charge of the site is reversed to negative, thereby forcing any unbound or nonspecifically 

20 bound DNA back into solution away from the capture probes. A laser-based fluorescence 
scanner is used to detect binding, 

In still further embodiments, an array technology based upon the segregation of 
fluids on a flat surface (chip) by differences in surface tension (ProtoGene, Palo Alto, 
CA) is utilized (See e.g., U.S. Patent Nos. 6,001,31 1; 5,985,551; and 5,474,796; each of 

25 which is herein incorporated by reference). Protogene's technology is based on the fact 
that fluids can be segregated on a flat surface by differences in surface tension that have 
been imparted by chemical coatings. Once so segregated, oligonucleotide probes are 
synthesized directly on the chip by ink-jet printing of reagents. The array with its 
reaction sites defined by surface tension is mounted on a X/Y translation stage under a set 

30 of four piezoelectric nozzles, one for each of the four standard DNA bases. The 

translation stage moves along each of the rows of the array and the appropriate reagent is 
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delivered to each of the reaction site. For example, the A amidite is delivered only to the 
sites where amidite A is to be coupled during that synthesis step and so on. Common 
reagents and washes are delivered by flooding the entire surface and then removing them 
by spinning. 

5 DNA probes unique for the SNP or mutation of interest are affixed to the chip 

using Protogene's technology. The chip is then contacted with the PCR-amplified genes 
of interest. Following hybridization, unbound DNA is removed and hybridization is 
detected using any suitable method (e.g., by fluorescence de-quenching of an 
incorporated fluorescent group). 

10 In yet other embodiments, a "bead array" is used for the detection of 

polymorphisms (Illumina, San Diego, CA; See e.g., PCT Publications WO 99/67641 and 
WO 00/39587, each of which is herein incorporated by reference). Illumina uses a 
BEAD ARRAY technology that combines fiber optic bundles and beads that 
self-assemble into an array. Each fiber optic bundle contains thousands to millions of 

15 individual fibers depending on the diameter of the bundle. The beads are coated with an 
oligonucleotide specific for the detection of a given SNP or mutation. Batches of beads 
are combined to form a pool specific to the array. To perform an assay, the BEAD 
ARRAY is contacted with a prepared subject sample (e.g., DNA). Hybridization is 
detected using any suitable method. 

20 

c. Enzymatic Detection of Hybridization 

In some embodiments of the present invention, hybridization is detected by 
enzymatic cleavage of specific structures (INVADER assay, Third Wave Technologies; 
See e.g., U.S. Patent Nos. 5,846,717, 6,090,543; 6,001,567; 5,985,557; and 5,994,069; 

25 each of which is herein incorporated by reference). The INVADER assay detects specific 
DNA and RNA sequences by using structure-specific enzymes to cleave a complex 
formed by the hybridization of overlapping oligonucleotide probes. Elevated temperature 
and an excess of one of the probes enable multiple probes to be cleaved for each target 
sequence present without temperature cycling. These cleaved probes then direct cleavage 

30 of a second labeled probe. The secondary probe oligonucleotide can be 5 '-end labeled 
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with fluorescein that is quenched by an internal dye. Upon cleavage, the de-quenched 
fluorescein labeled product may be detected using a standard fluorescence plate reader. 

The INVADER assay detects specific mutations and SNPs in unamplified 
genomic DNA. The isolated DNA sample is contacted with the first probe specific either 
5 for a SNP/mutation or wild type sequence and allowed to hybridize. Then a secondary 
probe, specific to the first probe, and containing the fluorescein label, is hybridized and 
the enzyme is added. Binding is detected by using a fluorescent plate reader and 
comparing the signal of the test sample to known positive and negative controls. 

In some embodiments, hybridization of a bound probe is detected using a TaqMan 

10 assay (PE Biosystems, Foster City, CA; See e.g., U.S. Patent Nos. 5,962,233 and 

5,538,848, each of which is herein incorporated by reference). The assay is performed 
during a PCR reaction. The TaqMan assay exploits the 5' -3' exonuclease activity of the 
AMPLITAQ GOLD DNA polymerase. A probe, specific for a given allele or mutation, 
is included in the PCR reaction. The probe consists of an oligonucleotide with a 5'- 

1 5 reporter dye (e.g. , a fluorescent dye) and a 3 '-quencher dye. During PCR, if the probe is 
bound to its target, the 5-3' nucleolytic activity of the AMPLITAQ GOLD polymerase 
cleaves the probe between the reporter and the quencher dye. The separation of the 
reporter dye from the quencher dye results in an increase of fluorescence. The signal 
accumulates with each cycle of PCR and can be monitored with a fluorimeter. 

20 In still further embodiments, polymorphisms are detected using the SNP-IT 

primer extension assay (Orchid Biosciences, Princeton, NJ; See e.g., U.S. Patent Nos. 
5,952,174 and 5,919,626, each of which is herein incorporated by reference). In this 
assay, SNPs are identified by using a specially synthesized DNA primer and a DNA 
polymerase to selectively extend the DNA chain by one base at the suspected SNP 

25 location. DNA in.the region of interest is amplified and denatured. Polymerase reactions 
are then performed using miniaturized systems called microfluidics. Detection is 
accomplished by adding a label to the nucleotide suspected of being at the SNP or 
mutation location. Incorporation of the label into the DNA can be detected by any 
suitable method (e.g., if the nucleotide contains a biotin label, detection is via a 

30 fluorescently labeled antibody specific for biotin). 
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6. Mass Spectroscopy Assay 

In some embodiments, a MassARRAY system (Sequenom, San Diego, CA.) is 
used to detect variant sequences (See e.g., U.S. Patent Nos. 6,043,031; 5,777,324; and 
5,605,798; each of which is herein incorporated by reference). DNA is isolated from 
5 blood samples using standard procedures. Next, specific DNA regions containing the 
. mutation or SNP of interest, about 200 base pairs in length, are amplified by PCR. The 
amplified fragments are then attached by one strand to a solid surface and the 
non-immobilized strands are removed by standard denaturation and washing. The 
remaining immobilized single strand then serves as a template for automated enzymatic 

10 reactions that produce genotype specific diagnostic products. 

Very small quantities of the enzymatic products, typically five to ten nanoliters, 
are then transferred to a SpectroCHIP array for subsequent automated analysis with the 
SpectroREADER mass spectrometer. Each spot is preloaded with light absorbing 
crystals that form a matrix with the dispensed diagnostic product. The MassARRAY 

15 system uses MALDI-TOF (Matrix Assisted Laser Desorption Ionization - Time of Flight) 
mass spectrometry. In a process known as desorption, the matrix is hit with a pulse from 
a laser beam. Energy from the laser beam is transferred to the matrix and it is vaporized 
resulting in a small amount of the diagnostic product being expelled into a flight tube. As 
the diagnostic product is charged when an electrical field pulse is subsequently applied to 

20 the tube they are launched down the flight tube towards a detector. The time between 
application of the electrical field pulse and collision of the diagnostic product with the 
detector is referred to as the time of flight. This is a very precise measure of the produces 
molecular weight, as a molecule's mass correlates directly with time of flight with smaller 
molecules flying faster than larger molecules. The entire assay is completed in less than 

25 one thousandth of a second, enabling samples to be analyzed in a total of 3-5 second 
including repetitive data collection. The SpectroTYPER software then calculates, 
records, compares and reports the genotypes at the rate of three seconds per sample. 

7. Detection of Variant Cayman ataxia Proteins 
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In other embodiments, variant Cayman ataxia polypeptides are detected. Any 
suitable method may be used to detect truncated or mutant Cayman ataxia polypeptides 
including, but not limited to, those described below. 

a) Cell Free Translation 

For example, in some embodiments, cell-free translation methods from 
Ambergen, Inc. (Boston, MA) are utilized. Ambergen, Inc. has developed a method for 
the labeling, detection, quantitation, analysis and isolation of nascent proteins produced 
in a cell-free or cellular translation system without the use of radioactive amino acids or 
other radioactive labels. Markers are aminoacylated to tRNA molecules. Potential 
markers include native amino acids, non-native amino acids, amino acid analogs or 
derivatives, or chemical moieties. These markers are introduced into nascent proteins 
from the resulting misaminoacylated tRNAs during the translation process. 

One application of Ambergen's protein labeling technology is the gel free 
truncation test (GFTT) assay (See e.g., U.S. Patent 6,303,337, herein incorporated by 
reference). In some embodiments, this assay is used to screen for truncation mutations in 
a Cayman ataxia protein. In the GFTT assay, a marker (e.g., a fluorophore) is introduced 
to the nascent protein during translation near the N-terminus of the protein. A second and 
different marker (e.g., a fluorophore with a different emission wavelength) is introduced 
to the nascent protein near the C-terminus of the protein. The protein is then separated 
from the translation system and the signal from the markers is measured. A comparison 
of the measurements from the N and C terminal signals provides information on the 
fraction of the molecules with C-terminal truncation {i.e., if the normalized signal from 
the C-terminal marker is 50% of the signal from the N-terminal marker, 50% of the 
molecules have a C-terminal truncation). 

b) Antibody Binding 

In still further embodiments of the present invention, antibodies (See below for 
antibody production) are used to determine if an individual contains an allele encoding a 
variant Cayman ataxia gene. In preferred embodiments, antibodies are utilized that 
discriminate between variant {i.e., truncated proteins); and wild-type proteins (SEQ ID 
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N0s:4). In some particularly preferred embodiments, the antibodies are directed to the 
C-terminus of Cayman ataxia. Proteins that are recognized by the N-terminal, but not the 
C-terminal antibody are truncated. In some embodiments, quantitative immunoassays are 
used to determine the ratios of C-terminal to N-terminal antibody binding. In other 
5 embodiments, antibodies that differentially bind to wild type or variant forms of Cayman 
ataxia. 

Antibody binding is detected by techniques known in the art (e.g., 
radioimmunoassay, ELISA (enzyme-linked immunosorbant assay), "sandwich" 
immunoassays, immunoradiometric assays, gel diffusion precipitation reactions, 
10 immunodiffusion assays, in situ immunoassays (e.g., using colloidal gold, enzyme or 
radioisotope labels, for example), Western blots, precipitation reactions, agglutination 
assays (e.g., gel agglutination assays, hemagglutination assays, etc.), complement fixation 
assays, immunofluorescence assays, protein A assays, and immunoelectrophoresis assays, 
etc. 

15 In one embodiment, antibody binding is detected by detecting a label on the 

primary antibody. In another embodiment, the primary antibody is detected by detecting 
binding of a secondary antibody or reagent to the primary antibody. In a further 
embodiment, the secondary antibody is labeled. Many methods are known in the art for 
detecting binding in an immunoassay and are within the scope of the present invention. 

20 In some embodiments, an automated detection assay is utilized. Methods for the 

automation of immunoassays include those described in U.S. Patents 5,885,530, 
4,981,785, 6,159,750, and 5,358,691, each of which is herein incorporated by reference. 
In some embodiments, the analysis and presentation of results is also automated. For 
example, in some embodiments, software that generates a prognosis based on the result 

25 of the immunoassay is utilized. 

In other embodiments, the immunoassay described in U.S. Patents 5,599,677 and 
5,672,480; each of which is herein incorporated by reference. 

8. Kits for Analyzing Risk of Ataxia 

30 The present invention also provides kits for determining whether an individual 

contains a wild-type or variant (e.g., mutant or polymorphic) allele of Cayman ataxia. In 
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some embodiments, the kits are useful determining whether the subject is at risk of 
developing Cayman ataxia or other movement disease. The diagnostic kits are produced 
in a variety of ways. In some embodiments, the kits contain at least one reagent for 
specifically detecting a mutant Cayman ataxia allele or protein. In some embodiments, 
5 the reagent is a nucleic acid that hybridizes to nucleic acids containing the mutation and 
that does not bind to nucleic acids that do not contain the mutation. In other preferred 
embodiments, the reagents are primers for amplifying the region of DNA containing the 
mutation. In still other embodiments, the reagents are antibodies that preferentially bind 
either the wild-type or mutant Cayman ataxia proteins. 

10 In some embodiments, the kit contains instructions for determining whether the 

subject is at risk for developing ataxia or other movement diseases. In preferred 
embodiments, the instructions specify that risk for developing ataxia is determined by 
detecting the presence or absence of a mutant Cayman ataxia allele in the subject, 
wherein subjects having an mutant allele are at greater risk for ataxia or for having 

1 5 offspring with Cayman ataxia (in the case of a parent who is a carrier). 

The presence of absence of a disease-associated mutation in a Cayman ataxia 
gene can be used to may therapeutic or other medical decisions. For example, couples 
with a family history of hereditary ataxia may choose to conceive a child via in vitro 
fertilization and pre-implantation genetic screening. In this case, fertilized embryos are 

20 screened for mutant {e.g., disease associated) alleles of the Cayman ataxia gene and only 
embryos with wild type alleles are implanted in the uterus. 

In other embodiments, in utero screening is performed on a developing fetus {e.g., 
amniocentesis or chorionic villi screening). In still other embodiments, genetic screening 
of newborn babies or very young children is performed. The early detection of a Cayman 

25 ataxia allele known to be associated with disease allows for early intervention {e.g., 
genetic or pharmaceutical therapies). 

In some embodiments, the kits include ancillary reagents such as buffering agents, 
nucleic acid stabilizing reagents, protein stabilizing reagents, and signal producing 
systems {e.g., florescence generating systems as Fret systems). The test kit may be 

30 packages in any suitable manner, typically with the elements in a single container or 
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various containers as necessary along with a sheet of instructions for carrying out the test. 
In some embodiments, the kits also preferably include a positive control sample. 

9. Bioinformatics 

5 In some embodiments, the present invention provides methods of determining an 

individual's risk of developing ataxia or related movement disorders based on the 
presence of one or more variant alleles of Cayman ataxia. In some embodiments, the 
analysis of variant data is processed by a computer using information stored on a 
computer (e.g., in a database). For example, in some embodiments, the present invention 

10 provides a bioinformatics research system comprising a plurality of computers running a 
multi-platform object oriented programming language (See e.g., U.S. Patent 6,125,383; 
herein incorporated by reference). In some embodiments, one of the computers stores 
genetics data (e.g., the risk of contacting ataxia or related disorder associated with a given 
polymorphism, as well as the sequences). In some embodiments, one of the computers 

15 stores application programs (e.g., for analyzing the results of detection assays). Results 
are then delivered to the user (e.g., via one of the computers or via the internet. 

For example, in some embodiments, a computer-based analysis program is used 
to translate the raw data generated by the detection assay (e.g., the presence, absence, or 
amount of a given Cayman ataxia allele or polypeptide) into data of predictive value for a 

20 clinician. The clinician can access the predictive data using any suitable means. Thus, in 
some preferred embodiments, the present invention provides the further benefit that the 
clinician, who is not likely to be trained in genetics or molecular biology, need not 
understand the raw data. The data is presented directly to the clinician in its most useful 
form. The clinician is then able to immediately utilize the information in order to 

25 optimize the care of the subject. 

The present invention contemplates any method capable of receiving, processing, 
and transmitting the information to and from laboratories conducting the assays, 
information provides, medical personal, and subjects. For example, in some 
embodiments of the present invention, a sample (e.g., a biopsy or a serum or urine 

30 sample) is obtained from a subject and submitted to a profiling service (e.g., clinical lab 
at a medical facility, genomic profiling business, etc.), located in any part of the world 
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(e.g., in a country different than the country where the subject resides or where the 
information is ultimately used) to generate raw data. Where the sample comprises a 
tissue or other biological sample, the subject may visit a medical center to have the 
sample obtained and sent to the profiling center, or subjects may collect the sample 
5 themselves (e.g., a urine sample) and directly send it to a profiling center. Where the 
sample comprises previously determined biological information, the information may be 
directly sent to the profiling service by the subject (e.g., an information card containing 
the information may be scanned by a computer and the data transmitted to a computer of 
the profiling center using an electronic communication systems). Once received by the 

10 profiling service, the sample is processed and a profile is produced (i.e., presence of wild 
type or mutant Cayman ataxia genes or polypeptides), specific for the diagnostic or 
prognostic information desired for the subject. 

The profile data is then prepared in a format suitable for interpretation by a 
treating clinician. For example, rather than providing raw data, the prepared format may 

15 represent a diagnosis or risk assessment (e.g., likelihood of developing ataxia or a 

diagnosis of ataxia) for the subject, along with recommendations for particular treatment 
options. The data may be displayed to the clinician by any suitable method. For 
example, in some embodiments, the profiling service generates a report that can be 
printed for the clinician (e.g., at the point of care) or displayed to the clinician on a 

20 computer monitor. 

In some embodiments, the information is first analyzed at the point of care or at a 
regional facility. The raw data is then sent to a central processing facility for further 
analysis and/or to convert the raw data to information useful for a clinician or patient. 
The central processing facility provides the advantage of privacy (all data is stored in a 

25 central facility with uniform security protocols), speed, and uniformity of data analysis. 
The central processing facility can then control the fate of the data following treatment of 
the subject. For example, using an electronic communication system, the central facility 
can provide data to the clinician, the subject, or researchers. 

In some embodiments, the subject is able to directly access the data using the 

30 electronic communication system. The subject may chose further intervention or 

counseling based on the results. In some embodiments, the data is used for research use. 
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For example, the data may be used to further optimize the inclusion or elimination of 
markers as useful indicators of a particular condition or stage of disease. 

IV. Generation of Cayman ataxia Antibodies 

5 The present invention provides isolated antibodies or antibody fragments (e.g., 

FAB fragments). Antibodies can be generated to allow for the detection of Cayman 
ataxia protein. The antibodies may be prepared using various immunogens. In one 
embodiment, the immunogen is a human Cayman ataxia peptide to generate antibodies 
that recognize human Cayman ataxia. Such antibodies include, but are not limited to 

10 polyclonal, monoclonal, chimeric, single chain, Fab fragments, Fab expression libraries, 
or recombinant (e.g., chimeric, humanized, etc.) antibodies, as long as it can recognize 
the protein. Antibodies can be produced by using a protein of the present invention as the 
antigen according to a conventional antibody or antiserum preparation process. 

Various procedures known in the art may be used for the production of polyclonal 

15 antibodies directed against Cayman ataxia. For the production of antibody, various host 
animals can be immunized by injection with the peptide corresponding to the Cayman 
ataxia epitope including but not limited to rabbits, mice, rats, sheep, goats, etc. In a 
preferred embodiment, the peptide is conjugated to an immunogenic carrier (e.g., 
diphtheria toxoid, bovine serum albumin (BSA), or keyhole limpet hemocyanin (KLH)). 

20 Various adjuvants may be used to increase the immunological response, depending on the 
host species, including but not limited to Freund's (complete and incomplete), mineral 
gels (e.g., aluminum hydroxide), surface active substances (e.g., lysolecithin, pluronic 
polyols, polyanions, peptides, oil emulsions, keyhole limpet hemocyanins, dinitrophenol, 
and potentially useful human adjuvants such as BCG (Bacille Calmette-Guerin) and 

25 Corynebacterium parvum). 

For preparation of monoclonal antibodies directed toward Cayman ataxia, it is 
contemplated that any technique that provides for the production of antibody molecules 
by continuous cell lines in culture will find use with the present invention (See e.g., 
Harlow and Lane, Antibodies: A Laboratory Manual Cold Spring Harbor Laboratory 

30 Press, Cold Spring Harbor, NY). These include but are not limited to the hybridoma 
technique originally developed by Kohler and Milstein (Kohler and Milstein, Nature 
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256:495-497 [1975]), as well as the trioma technique, the human B-cell hybridoma 
technique (See e.g., Kozbor et al, Immunol Tod., 4:72 [1983]), and the EBV-hybridoma 
technique to produce human monoclonal antibodies (Cole et al., in Monoclonal 
Antibodies and Cancer Therapy, Alan R. Liss, Inc., pp. 77-96 [1985]). 
5 In an additional embodiment of the invention, monoclonal antibodies are 

produced in germ-free animals utilizing technology such as that described in 
PCT/US90/02545). Furthermore, it is contemplated that human antibodies will be 
generated by human hybridomas (Cote et al, Proc. Natl. Acad. Sci. USA 80:2026-2030 
[1983]) or by transforming human B cells with EBV virus in vitro (Cole et al, in 

10 Monoclonal Antibodies and Cancer Therapy, Alan R. Liss, pp. 77-96 [1985]). 

In addition, it is contemplated that techniques described for the production of 
single chain antibodies (U.S. Patent 4,946,778; herein incorporated by reference) will 
find use in producing Cayman ataxia specific single chain antibodies. An additional 
embodiment of the invention utilizes the techniques described for the construction of Fab 

15 expression libraries (Huse et al, Science 246:1275-1281 [1989]) to allow rapid and easy 
identification of monoclonal Fab fragments with the desired specificity for Cayman 
ataxia. 

In other embodiments, the present invention contemplated recombinant antibodies 
or fragments thereof to the proteins of the present invention. Recombinant antibodies 
20 include, but are not limited to, humanized and chimeric antibodies. Methods for 

generating recombinant antibodies are known in the art (See e.g., U.S. Patents 6,180,370 
and 6,277,969 and "Monoclonal Antibodies" H. Zola, BIOS Scientific Publishers Limited 
2000. Springer- Verlay New York, Inc., New York; each of which is herein incorporated 
by reference). 

25 It is contemplated that any technique suitable for producing antibody fragments 

will find use in generating antibody fragments that contain the idiotype (antigen binding 
region) of the antibody molecule. For example, such fragments include but are not 
limited to: F(ab')2 fragment that can be produced by pepsin digestion of the antibody 
molecule; Fab 1 fragments that can be generated by reducing the disulfide bridges of the 

30 F(ab')2 fragment, and Fab fragments that can be generated by treating the antibody 
molecule with papain and a reducing agent. 
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In the production of antibodies, it is contemplated that screening for the desired 
antibody will be accomplished by techniques known in the art {e.g., radioimmunoassay, 
ELISA (enzyme-linked immunosorbant assay), "sandwich" immunoassays, 
immunoradiometric assays, gel diffusion precipitation reactions, immunodiffusion assays, 
5 in situ immunoassays {e.g., using colloidal gold, enzyme or radioisotope labels, for 
example), Western blots, precipitation reactions, agglutination assays {e.g., gel 
agglutination assays, hemagglutination assays, etc.), complement fixation assays, 
immunofluorescence assays, protein A assays, and Immunoelectrophoresis assays, etc. 
In one embodiment, antibody binding is detected by detecting a label on the 

10 primary antibody. In another embodiment, the primary antibody is detected by detecting 
binding of a secondary antibody or reagent to the primary antibody. In a further 
embodiment, the secondary antibody is labeled. Many means are known in the art for 
detecting binding in an immunoassay and are within the scope of the present invention. 
As is well known in the art, the immunogenic peptide should be provided free of the 

15 carrier molecule used in any immunization protocol. For example, if the peptide was 
conjugated to KLH, it may be conjugated to BSA, or used directly, in a screening assay.) 

The foregoing antibodies can be used in methods known in the art relating to 
the localization and structure of Cayman ataxia {e.g., for Western blotting), measuring 
levels thereof in appropriate biological samples, etc. The antibodies can be used to detect 

20 Cayman ataxia in a biological sample from an individual. The biological sample can be a 
biological fluid, such as, but not limited to, blood, serum, plasma, interstitial fluid, urine, 
cerebrospinal fluid, and the like, containing cells. 

The biological samples can then be tested directly for the presence of human 
Cayman ataxia using an appropriate strategy {e.g., ELISA or radioimmunoassay) and 

25 format {e.g., microwells, dipstick {e.g., as described in International Patent Publication 
WO 93/03367), etc. Alternatively, proteins in the sample can be size separated {e.g., by 
polyacrylamide gel electrophoresis (PAGE), in the presence or not of sodium dodecyl 
sulfate (SDS), and the presence of Cayman ataxia detected by immunoblotting (Western 
blotting). Immunoblotting techniques are generally more effective with antibodies 

30 generated against a peptide corresponding to an epitope of a protein, and hence, are 
particularly suited to the present invention. 
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Another method uses antibodies as agents to alter signal transduction. Specific 
antibodies that bind to the binding domains of Cayman ataxia or other proteins involved 
in intracellular signaling can be used to inhibit the interaction between the various 
proteins and their interaction with other ligands. Antibodies that bind to the complex can 
also be used therapeutically to inhibit interactions of the protein complex in the signal 
transduction pathways leading to the various physiological and cellular effects of 
Cayman ataxia. Such antibodies can also be used diagnostically to measure abnormal 
expression of Cayman ataxia, or the aberrant formation of protein complexes, which may 
be indicative of a disease state. 

V. Gene Therapy Using Cayman ataxia 

The present invention also provides methods and compositions suitable for gene 
therapy to alter Cayman ataxia expression, production, or function. As described above, 
the present invention provides human Cayman ataxia genes and provides methods of 
obtaining Cayman ataxia genes from other species. Thus, the methods described below 
are generally applicable across many species. In some embodiments, it is contemplated 
that the gene therapy is performed by providing a subject with a wild-type allele of 
Cayman ataxia an allele that does not contain a Cayman ataxia disease (e.g., free of 
disease causing polymorphisms or mutations). Subjects in need of such therapy are 
identified by the methods described above. 

Viral vectors commonly used for in vivo or ex vivo targeting and therapy 
procedures are DNA-based vectors and retroviral vectors. Methods for constructing and 
using viral vectors are known in the art (See e.g., Miller and Rosman, BioTech., 
7:980-990 [1992]). Preferably, the viral vectors are replication defective, that is, they are 
unable to replicate autonomously in the target cell. In general, the genome of the 
replication defective viral vectors that are used within the scope of the present invention 
lack at least one region that is necessary for the replication of the virus in the infected 
cell. These regions can either be eliminated (in whole or in part), or be rendered 
non-functional by any technique known to a person skilled in the art. These techniques 
include the total removal, substitution (by other sequences, in particular by the inserted 
nucleic acid), partial deletion or addition of one or more bases to an essential (for 
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replication) region. Such techniques may be performed in vitro (i.e., on the isolated 
DNA) or in situ, using the techniques of genetic manipulation or by treatment with 
mutagenic agents. 

Preferably, the replication defective virus retains the sequences of its genome that 
5 are necessary for encapsidating the viral particles. DNA viral vectors include an 

attenuated or defective DNA viruses, including, but not limited to, herpes simplex virus 
(HSV), papillomavirus, Epstein Barr virus (EBV), adenovirus, adeno-associated virus 
(AAV), and the like. Defective viruses, that entirely or almost entirely lack viral genes, 
are preferred, as defective virus is not infective after introduction into a cell. Use of 

10 defective viral vectors allows for administration to cells in a specific, localized area, 
without concern that the vector can infect other cells. Thus, a specific tissue can be 
specifically targeted. Examples of particular vectors include, but are not limited to, a 
defective herpes virus 1 (HSV1) vector (Kaplitt et al, Mol. Cell. Neurosci., 2:320-330 
[1991]), defective herpes virus vector lacking a glycoprotein L gene (See e.g., Patent 

15 Publication RD 371005 A), or other defective herpes virus vectors (See e.g., WO 
94/21807; and WO 92/05263); an attenuated adenovirus vector, such as the vector 
described by Stratford-Perricaudet et al (J. Clin. Invest., 90:626-630 [1992]; See also, La 
Salle et al, Science 259:988-990 [1993]); and a defective adeno-associated virus vector 
(Samulski etal, J. Virol., 61:3096-3101 [1987]; Samulski et al, J. Virol., 63:3822-3828 

20 [1989]; and Lebkowski et al, Mol. Cell. Biol., 8:3988-3996 [1988]). 

Preferably, for in vivo administration, an appropriate immunosuppressive 
treatment is employed in conjunction with the viral vector (e.g., adenovirus vector), to 
avoid immuno-deactivation of the viral vector and transfected cells. For example, 
immunosuppressive cytokines, such as interleukin-12 (IL-12), interferon-gamma (IFN-y), 

25 or anti-CD4 antibody, can be administered to block humoral or cellular immune 

responses to the viral vectors. In addition, it is advantageous to employ a viral vector that 
is engineered to express a minimal number of antigens. 

In a preferred embodiment, the vector is an adenovirus vector. Adenoviruses are 
eukaryotic DNA viruses that can be modified to efficiently deliver a nucleic acid of the 

30 invention to a variety of cell types. Various serotypes of adenovirus exist. Of these 

serotypes, preference is given, within the scope of the present invention, to type 2 or type 
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5 human adenoviruses (Ad 2 or Ad 5), or adenoviruses of animal origin (See e.g., WO 
94/26914). Those adenoviruses of animal origin that can be used within the scope of the 
present invention include adenoviruses of canine, bovine, murine (e.g., Mavl, Beard et 
aL, Virol, 75-81 [1990]), ovine, porcine, avian, and simian (e.g., SAV) origin. 
5 Preferably, the adenovirus of animal origin is a canine adenovirus, more preferably a 
CAV2 adenovirus (e.g. Manhattan or A26/61 strain (ATCC VR-800)). 

Preferably, the replication defective adenoviral vectors of the invention comprise 
the ITRs, an encapsidation sequence and the nucleic acid of interest. Still more 
preferably, at least the El region of the adenoviral vector is non-functional. The deletion 

10 in the El region preferably extends from nucleotides 455 to 3329 in the sequence of the 
Ad5 adenovirus (PvulI-BgUI fragment) or 382 to 3446 (Hinfil-Sau3A fragment). Other 
regions may also be modified, in particular the E3 region (e.g., WO 95/02697), the E2 
region (e.g., WO 94/28938), the E4 region (e.g., WO 94/28152, WO 94/12649 and WO 
95/02697), or in any of the late genes L1-L5. 

15 In a preferred embodiment, the adenoviral vector has a deletion in the 

El region (Ad 1.0). Examples of El -deleted adenoviruses are disclosed in EP 185,573, 
the contents of which are incorporated herein by reference. In another preferred 
embodiment, the adenoviral vector has a deletion in the El and E4 regions (Ad 3.0). 
Examples of El/E4-deleted adenoviruses are disclosed in WO 95/02697 and WO 

20 96/22378. In still another preferred embodiment, the adenoviral vector has a deletion in 
the El region into which the E4 region and the nucleic acid sequence are inserted. 

The replication defective recombinant adenoviruses according to the invention 
can be prepared by any technique known to the person skilled in the art (See e.g., Levrero 
etal, Gene 101:195 [1991]; EP 185 573; and Graham, EMBO J., 3:2917 [1984]). In 

25 particular, they can be prepared by homologous recombination between an adenovirus 
and a plasmid that carries, inter alia, the DNA sequence of interest. The homologous 
recombination is accomplished following co-transfection of the adenovirus and plasmid 
into an appropriate cell line. The cell line that is employed should preferably (i) be 
transformable by the elements to be used, and (ii) contain the sequences that are able to 

30 complement the part of the genome of the replication defective adenovirus, preferably in 
integrated form in order to avoid the risks of recombination. Examples of cell lines that 
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may be used are the human embryonic kidney cell line 293 (Graham et ai, J. Gen. Virol., 
36:59 [1977]), which contains the left-hand portion of the genome of an Ad5 adenovirus 
(12%) integrated into its genome, and cell lines that are able to complement the El and 
E4 functions, as described in applications WO 94/26914 and WO 95/02697. 
5 Recombinant adenoviruses are recovered and purified using standard molecular 
biological techniques that are well known to one of ordinary skill in the art. 

The adeno-associated viruses (AAV) are DNA viruses of relatively small size that 
can integrate, in a stable and site-specific manner, into the genome of the cells that they 
infect. They are able to infect a wide spectrum of cells without inducing any effects on 

10 cellular growth, morphology or differentiation, and they do not appear to be involved in 
human pathologies. The AAV genome has been cloned, sequenced and characterized. It 
encompasses approximately 4700 bases and contains an inverted terminal repeat (ITR) 
region of approximately 145 bases at each end, which serves as an origin of replication 
for the virus. The remainder of the genome is divided into two essential regions that 

1 5 carry the encapsidation functions: the left-hand part of the genome, that contains the rep 
gene involved in viral replication and expression of the viral genes; and the right-hand 
part of the genome, that contains the cap gene encoding the capsid proteins of the virus. 

The use of vectors derived from the AAVs for transferring genes in vitro and in 
vivo has been described (See e.g., WO 91/18088; WO 93/09239; US Pat. No. 4,797,368; 

20 US Pat. No., 5,139,941; and EP 488 528, all of which are herein incorporated by 

reference). These publications describe various AAV-derived constructs in which the rep 
and/or cap genes are deleted and replaced by a gene of interest, and the use of these 
constructs for transferring the gene of interest in vitro (into cultured cells) or in vivo 
(directly into an organism). The replication defective recombinant AAVs according to 

25 the invention can be prepared by co-transfecting a plasmid containing the nucleic acid 
sequence of interest flanked by two AAV inverted terminal repeat (ITR) regions, and a 
plasmid carrying the AAV encapsidation genes (rep and cap genes), into a cell line that is 
infected with a human helper virus (for example an adenovirus). The AAV recombinants 
that are produced are then purified by standard techniques. 

30 In another embodiment, the gene can be introduced in a retroviral vector (e.g., as 

described in U.S. Pat. Nos. 5,399,346, 4,650,764, 4,980,289 and 5,124,263; all of which 



63 



are herein incorporated by reference; Mann et al, Cell 33:153 [1983]; Markowitz et al, 
J. Virol, 62:1 120 [1988]; PCT/US95/14575; EP 453242; EP178220; Bernstein et al 
Genet. Eng., 7:235 [1985]; McCormick, BioTechnol., 3:689 [1985]; WO 95/07358; and 
Kuo et al, Blood 82:845 [1993]). The retroviruses are integrating viruses that infect 
dividing cells. The retrovirus genome includes two LTRs, an encapsidation sequence and 
three coding regions (gag, pol and env). In recombinant retroviral vectors, the gag, pol 
and env genes are generally deleted, in whole or in part, and replaced with a heterologous 
nucleic acid sequence of interest. These vectors can be constructed from different types 
of retrovirus, such as, HIV, MoMuLV ("murine Moloney leukemia virus" MSV ("murine 
Moloney sarcoma virus"), HaSV ("Harvey sarcoma virus"); SNV ("spleen necrosis 
virus"); RSV ("Rous sarcoma virus") and Friend virus. Defective retroviral vectors are 
also disclosed in WO 95/02697. 

In general, in order to construct recombinant retroviruses containing a nucleic 
acid sequence, a plasmid is constructed that contains the LTRs, the encapsidation 
sequence and the coding sequence. This construct is used to transfect a packaging cell 
line, which cell line is able to supply in trans the retroviral functions that are deficient in 
the plasmid. In general, the packaging cell lines are thus able to express the gag, pol and 
env genes. Such packaging cell lines have been described in the prior art, in particular 
the cell line PA317 (US Pat. No. 4,861,719, herein incorporated by reference), the 
PsiCRIP cell line (See, WO90/02806), and the GP+envAm-12 cell line (See, 
WO89/07150). In addition, the recombinant retroviral vectors can contain modifications 
within the LTRs for suppressing transcriptional activity as well as extensive 
encapsidation sequences that may include a part of the gag gene (Bender et al, J. Virol., 
61:1639 [1987]). Recombinant retroviral vectors are purified by standard techniques 
known to those having ordinary skill in the art. 

Alternatively, the vector can be introduced in vivo by lipofection. For the past 
decade, there has been increasing use of liposomes for encapsulation and transfection of 
nucleic acids in vitro. Synthetic cationic lipids designed to limit the difficulties and 
dangers encountered with liposome mediated transfection can be used to prepare 
liposomes for in vivo transfection of a gene encoding a marker (Feigner et. al, Proc. Natl. 
Acad. Sci. USA 84:7413-7417 [1987]; See also, Mackey, et al, Proc. Natl. Acad. Sci. 
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USA 85:8027-8031 [1988]; Ulmeref a/., Science 259:1745-1748 [1993]). The use of 
cationic lipids may promote encapsulation of negatively charged nucleic acids, and also 
promote fusion with negatively charged cell membranes (Feigner and Ringold, Science 
337:387-388 [1989]). Particularly useful lipid compounds and compositions for transfer 
5 of nucleic acids are described in W095/18863 and W096/17823, and in U.S. Pat. No. 
5,459,127, herein incorporated by reference. 

Other molecules are also useful for facilitating transfection of a nucleic acid in 
vivo, such as a cationic oligopeptide (eg., W095/21931), peptides derived from DNA 
binding proteins (e.g., WO96/25508), or a cationic polymer (e.g., W095/21931). 

10 It is also possible to introduce the vector in vivo as a naked DNA 

plasmid. Methods for formulating and administering naked DNA to mammalian muscle 
tissue are disclosed in U.S. Pat. Nos. 5,580,859 and 5,589,466, both of which are herein 
incorporated by reference. 

DNA vectors for gene therapy can be introduced into the desired host cells by 

1 5 methods known in the art, including but not limited to transfection, electroporation, 

microinjection, transduction, cell fusion, DEAE dextran, calcium phosphate precipitation, 
use of a gene gun, or use of a DNA vector transporter (See e.g., Wu et ai, J. Biol. Chem., 
267:963 [1992]; Wu and Wu, J. Biol. Chem., 263:14621 [1988]; and Williams etaL, 
Proc. Natl. Acad. Sci. USA 88:2726 [1991]). Receptor-mediated DNA delivery 

20 approaches can also be used (Curiel et al., Hum. Gene Ther., 3:147 [1992]; and Wu and 
Wu, J. Biol. Chem., 262:4429 [1987]). 

VI, Transgenic Animals Expressing Exogenous Cayman ataxia Genes and 
Homologs, Mutants, and Variants Thereof 

25 The present invention contemplates the generation of transgenic animals 

comprising an exogenous Cayman ataxia gene or homologs, mutants, or variants thereof. 
In preferred embodiments, the transgenic animal displays an altered phenotype as 
compared to wild-type animals. In some embodiments, the altered phenotype is the 
overexpression of mRNA for a Cayman ataxia gene as compared to wild-type levels of 

30 Cayman ataxia expression. In other embodiments, the altered phenotype is the decreased 
expression of mRNA for an endogenous Cayman ataxia gene as compared to wild-type 
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levels of endogenous Cayman ataxia expression. In some preferred embodiments, the 
transgenic animals comprise mutant (e.g., truncated) alleles of Cayman ataxia. Methods 
for analyzing the presence or absence of such phenotypes include Northern blotting, 
mRNA protection assays, and RT-PCR. In other embodiments, the transgenic mice have 
5 a knock out mutation of the Cayman ataxia gene. In preferred embodiments, the 
transgenic animals display an ataxia disease phenotype. 

Such animals find use in research applications (e.g., identifying signaling 
pathways that Cayman ataxia is involved in), as well as drug screening applications (e.g., 
to screen for drugs that prevent ataxia. For example, in some embodiments, test 

10 compounds (e.g., a drug that is suspected of being useful to treat ataxia) and control 
compounds (e.g., a placebo) are administered to the transgenic animals and the control 
animals and the effects evaluated. The effects of the test and control compounds on 
disease symptoms are then assessed. 

The transgenic animals can be generated via a variety of methods. In some 

15 embodiments, embryonal cells at various developmental stages are used to introduce 
transgenes for the production of transgenic animals. Different methods are used 
depending on the stage of development of the embryonal cell. The zygote is the best 
target for micro-injection. In the mouse, the male pronucleus reaches the size of 
approximately 20 micrometers in diameter, which allows reproducible injection of 1-2 

20 picoliters (pi) of DNA solution. The use of zygotes as a target for gene transfer has a 

major advantage in that in most cases the injected DNA will be incorporated into the host 
genome before the first cleavage (Brinster et al., Proc. Natl. Acad. Sci. USA 82:4438- 
4442 [1985]). As a consequence, all cells of the transgenic non-human animal will carry 
the incorporated transgene. This will in general also be reflected in the efficient 

25 transmission of the transgene to offspring of the founder since 50% of the germ cells will 
harbor the transgene. U.S. Patent No. 4,873,191 describes a method for the micro- 
injection of zygotes; the disclosure of this patent is incorporated herein in its entirety. 

In other embodiments, retroviral infection is used to introduce transgenes into a 
non-human animal. In some embodiments, the retroviral vector is utilized to transfect 

30 oocytes by injecting the retroviral vector into the perivitelline space of the oocyte (U.S. 
Pat. No. 6,080,912, incorporated herein by reference). In other embodiments, the 
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developing non-human embryo can be cultured in vitro to the blastocyst stage. During 
this time, the blastomeres can be targets for retroviral infection (Janenich, Proc. Natl. 
Acad. Sci. USA 73:1260 [1976]). Efficient infection of the blastomeres is obtained by 
enzymatic treatment to remove the zona pellucida (Hogan et al, in Manipulating the 
5 Mouse Embryo, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y. [1986]). 
The viral vector system used to introduce the transgene is typically a replication-defective 
retrovirus carrying the transgene (Jahner et al.,. Proc. Natl. Acad Sci. USA 82:6927 
[1985]). Transfection is easily and efficiently obtained by culturing the blastomeres on a 
monolayer of virus-producing cells (Van der Putten, supra; Stewart, et aL, EMBO J., 

10 6:383 [1987]). Alternatively, infection can be performed at a later stage. Virus or virus- 
producing cells can be injected into the blastocoele (Jahner et al., Nature 298:623 
[1982]). Most of the founders will be mosaic for the transgene since incorporation occurs 
only in a subset of cells that form the transgenic animal. Further, the founder may 
contain various retroviral insertions of the transgene at different positions in the genome 

15 that generally will segregate in the offspring. In addition, it is also possible to introduce 
transgenes into the germline, albeit with low efficiency, by intrauterine retroviral 
infection of the midgestation embryo (Jahner et al, supra [1982]). Additional means of 
using retroviruses or retroviral vectors to create transgenic animals known to the art 
involves the micro-injection of retroviral particles or mitomycin C-treated cells producing 

20 retrovirus into the perivitelline space of fertilized eggs or early embryos (PCT 

International Application WO 90/08832 [1990], and Haskell and Bowen, Mol. Reprod. 
Dev., 40:386 [1995]). 

In other embodiments, the transgene is introduced into embryonic stem cells and 
the transfected stem cells are utilized to form an embryo. ES cells are obtained by 

25 culturing pre-implantation embryos in vitro under appropriate conditions (Evans et al, 
Nature 292:154 [1981]; Bradley et al., Nature 309:255 [1984]; Gossler et al, Proc. Acad. 
Sci. USA 83:9065 [1986]; and Robertson et al, Nature 322:445 [1986]). Transgenes can 
be efficiently introduced into the ES cells by DNA transfection by a variety of methods 
known to the art including calcium phosphate co-precipitation, protoplast or spheroplast 

30 fusion, lipofection and DEAE-dextran-mediated transfection. Transgenes may also be 
introduced into ES cells by retro virus-mediated transduction or by micro-injection. Such 
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transfected ES cells can thereafter colonize an embryo following their introduction into 
the blastocoel of a blastocyst-stage embryo and contribute to the germ line of the 
resulting chimeric animal (for review, See, Jaenisch, Science 240:1468 [1988]). Prior to 
the introduction of transfected ES cells into the blastocoel, the transfected ES cells may 
5 be subjected to various selection protocols to enrich for ES cells which have integrated 
the transgene assuming that the transgene provides a means for such selection. 
Alternatively, the polymerase chain reaction may be used to screen for ES cells that have 
integrated the transgene. This technique obviates the need for growth of the transfected 
ES cells under appropriate selective conditions prior to transfer into the blastocoel. 
10 In still other embodiments, homologous recombination is utilized to knock-out 

gene function or create deletion mutants (e.g., mutants in which the LRRs of Cayman 
ataxia are deleted). Methods for homologous recombination are described in U.S. Pat. 
No. 5,614,396, incorporated herein by reference. 

1 5 VIII. Drug Screening Using Cayman ataxia 

As described herein, it is contemplated that Cayman ataxia is involved in 
hereditary movement disorders such as Caymans ataxia. Accordingly, in some 
embodiments, the isolated nucleic acid sequences of Cayman ataxia (e.g., SEQ ED NOs: 
3, 1 1, 8, and 10) are used in drug screening applications for compounds that are useful in 
20 the treatment of ataxia and related disorders. 

A, Identification of Binding Partners 

In some embodiments, binding partners of Cayman ataxia amino acids are 
identified. In some embodiments, the Cayman ataxia nucleic acid sequences (e.g., SEQ 

25 ID NOs: 3, 1 1, 8, and 10) or fragments thereof are used in yeast two-hybrid screening 
assays. For example, in some embodiments, the nucleic acid sequences are subcloned 
into pGPT9 (Clontech, La Jolla, CA) to be used as a bait in a yeast-2-hybrid screen for 
protein-protein interaction of a human fetal kidney cDNA library (Fields and Song 
Nature 340:245 -246, 1989; herein incorporated by reference). In other embodiments, 

30 phage display is used to identify binding partners (Parmley and Smith Gene 73 : 305-3 1 8, 
[1988]; herein incorporated by reference). 
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B. Drug Screening 

The present invention provides methods and compositions for using Cayman 
ataxia as a target for screening drugs that can alter, for example, interaction between 
5 Cayman ataxia and Cayman ataxia binding partners (e.g., those identified using the above 
methods) 

In one screening method, the two-hybrid system is used to screen for compounds 
(e.g., drug) capable of altering (e.g., inhibiting) Cayman ataxia function(s) (e.g., 
interaction with a binding partner) in vitro or in vivo. In one embodiment, a GAL4 

10 binding site, linked to a reporter gene such as lacZ, is contacted in the presence and 
absence of a candidate compound with a GAL4 binding domain linked to a Cayman 
ataxia fragment and a GAL4 transactivation domain II linked to a binding partner 
fragment. Expression of the reporter gene is monitored and a decrease in the expression 
is an indication that the candidate compound inhibits the interaction of Cayman ataxia 

15 with the binding partner. Alternately, the effect of candidate compounds on the 

interaction of Cayman ataxia with other proteins (e.g., proteins known to interact directly 
or indirectly with the binding partner) can be tested in a similar manner. 

In another screening method, candidate compounds are evaluated for their ability 
to alter Cayman ataxia signaling by contacting Cayman ataxia, binding partners, binding 

20 partner-associated proteins, or fragments thereof, with the candidate compound and 
determining binding of the candidate compound to the peptide. The protein or protein 
fragments is/are immobilized using methods known in the art such as binding a GST- 
Cayman ataxia fusion protein to a polymeric bead containing glutathione. A chimeric 
gene encoding a GST fusion protein is constructed by fusing DNA encoding the 

25 polypeptide or polypeptide fragment of interest to the DNA encoding the carboxyl 

terminus of GST (See e.g., Smith et al, Gene 67:31 [1988]). The fusion construct is then 
transformed into a suitable expression system (e.g., E. coli XA90) in which the 
expression of the GST fusion protein can be induced with isopropyl-p-D- 
thiogalactopyranoside (IPTG). Induction with IPTG should yield the fusion protein as a 

30 major constituent of soluble, cellular proteins. The fusion proteins can be purified by 
methods known to those skilled in the art, including purification by glutathione affinity 
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chromatography. Binding of the candidate compound to the proteins or protein 
fragments is correlated with the ability of the compound to disrupt the signal transduction 
pathway and thus regulate Cayman ataxia physiological effects (e.g., ataxia). 
In another screening method, one of the components of the Cayman 
5 ataxia/binding partner signaling system, is immobilized. Polypeptides can be 

immobilized using methods known in the art, such as adsorption onto a plastic microtiter 
plate or specific binding of a GST-fusion protein to a polymeric bead containing 
glutathione. For example, GST-Cayman ataxia is bound to glutathione-Sepharose beads. 
The immobilized peptide is then contacted with another peptide with which it is capable 

1 0 of binding in the presence and absence of a candidate compound. Unbound peptide is 
then removed and the complex solubilized and analyzed to determine the amount of 
bound labeled peptide. A decrease in binding is an indication that the candidate 
compound inhibits the interaction of Cayman ataxia with the other peptide. A variation 
of this method allows for the screening of compounds that are capable of disrupting a 

15 previously- formed protein/protein complex. For example, in some embodiments a 
complex comprising Cayman ataxia or a Cayman ataxia fragment bound to another 
peptide is immobilized as described above and contacted with a candidate compound. 
The dissolution of the complex by the candidate compound correlates with the ability of 
the compound to disrupt or inhibit the interaction between Cayman ataxia and the other 

20 peptide. 

Another technique for drug screening provides high throughput screening for 
compounds having suitable binding affinity to Cayman ataxia peptides and is described in 
detail in WO 84/03564, incorporated herein by reference. Briefly, large numbers of 
different small peptide test compounds are synthesized on a solid substrate, such as 

25 plastic pins or some other surface. The peptide test compounds are then reacted with 

Cayman ataxia peptides and washed. Bound Cayman ataxia peptides are then detected by 
methods well known in the art. 

Another technique uses Cayman ataxia antibodies, generated as discussed above. 
Such antibodies capable of specifically binding to Cayman ataxia peptides compete with 

30 a test compound for binding to Cayman ataxia. In this manner, the antibodies can be 
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used to detect the presence of any peptide that shares one or more antigenic determinants 
of the Cayman ataxia peptide. 

The present invention contemplates many other means of screening compounds. 
The examples provided above are presented merely to illustrate a range of techniques 
5 available. One of ordinary skill in the art will appreciate that many other screening 
methods can be used. 

In particular, the present invention contemplates the use of cell lines transfected 
with Cayman ataxia and variants thereof for screening compounds for activity, and in 
particular to high throughput screening of compounds from combinatorial libraries (e.g., 

10 libraries containing greater than 10 4 compounds). The cell lines of the present invention 
can be used in a variety of screening methods. In some embodiments, the cells can be 
used in second messenger assays that monitor signal transduction following activation of 
cell-surface receptors. In other embodiments, the cells can be used in reporter gene 
assays that monitor cellular responses at the transcription/translation level. In still further 

15 embodiments, the cells can be used in cell proliferation assays to monitor the overall 
growth/no growth response of cells to external stimuli. 

In second messenger assays, the host cells are preferably transfected as described 
above with vectors encoding Cayman ataxia or variants or mutants thereof. The host 
cells are then treated with a compound or plurality of compounds (e.g., from a 

20 combinatorial library) and assayed for the presence or absence of a response. It is 

contemplated that at least some of the compounds in the combinatorial library can serve 
as agonists, antagonists, activators, or inhibitors of the protein or proteins encoded by the 
vectors. It is also contemplated that at least some of the compounds in the combinatorial 
library can serve as agonists, antagonists, activators, or inhibitors of protein acting 

25 upstream or downstream of the protein encoded by the vector in a signal transduction 
pathway. 

In some embodiments, the second messenger assays measure fluorescent signals 
from reporter molecules that respond to intracellular changes (e.g., Ca 2+ concentration, 
membrane potential, pH, IP3, cAMP, arachidonic acid release) due to stimulation of 
30 membrane receptors and ion channels (e.g., ligand gated ion channels; see Denyer et al, 
Drug Discov. Today 3:323 [1998]; and Gonzales et al, Drug. Discov. Today 4:431-39 
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[1999]). Examples of reporter molecules include, but are not limited to, FRET 
(florescence resonance energy transfer) systems (e.g., Cuo-lipids and oxonols, 
EDAN/DABCYL), calcium sensitive indicators (e.g., Fluo-3, FURA 2, INDO 1, and 
FLU03/AM, BAPTA AM), chloride-sensitive indicators (e.g., SPQ, SPA), potassium- 
5 sensitive indicators (e.g., PBFI), sodium-sensitive indicators (e.g., SBFI), and pH 
sensitive indicators (e.g., BCECF). 

In general, the host cells are loaded with the indicator prior to exposure to the 
compound. Responses of the host cells to treatment with the compounds can be detected 
by methods known in the art, including, but not limited to, fluorescence microscopy, 

10 confocal microscopy (e.g., FCS systems), flow cytometry, microfluidic devices, FLIPR 
systems (See, e.g., Schroeder and Neagle, J. Biomol. Screening 1 :75 [1996]), and plate- 
reading systems. In some preferred embodiments, the response (e.g., increase in 
fluorescent intensity) caused by compound of unknown activity is compared to the 
response generated by a known agonist and expressed as a percentage of the maximal 

15 response of the known agonist. The maximum response caused by a known agonist is 
defined as a 100% response. Likewise, the maximal response recorded after addition of 
an agonist to a sample containing a known or test antagonist is detectably lower than the 
100% response. 

The cells are also useful in reporter gene assays. Reporter gene assays involve the 
20 use of host cells transfected with vectors encoding a nucleic acid comprising 

transcriptional control elements of a target gene (i.e., a gene that controls the biological 
expression and function of a disease target) spliced to a coding sequence for a reporter 
gene. Therefore, activation of the target gene results in activation of the reporter gene 
product. In some embodiments, the reporter gene construct comprises the 5 f regulatory 
25 region (e.g., promoters and/or enhancers) of a protein whose expression is controlled by 
Cayman ataxia in operable association with a reporter gene. Examples of reporter genes 
finding use in the present invention include, but are not limited to, chloramphenicol 
transferase, alkaline phosphatase, firefly and bacterial luciferases, p-galactosidase, P~ 
lactamase, and green fluorescent protein. The production of these proteins, with the 
30 exception of green fluorescent protein, is detected through the use of chemiluminescent, 
colorimetric, or bioluminecent products of specific substrates (e.g., X-gal and luciferin). 
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Comparisons between compounds of known and unknown activities may be conducted as 
described above. 

Specifically, the present invention provides screening methods for identifying 
modulators, i.e., candidate or test compounds or agents {e.g., proteins, peptides, 
5 peptidomimetics, peptoids, small molecules or other drugs) which bind to Cayman ataxia 
of the present invention, have an inhibitory (or stimulatory) effect on, for example, 
Cayman ataxia expression or Cayman ataxia activity, or have a stimulatory or inhibitory 
effect on, for example, the expression or activity of a Cayman ataxia substrate. 
Compounds thus identified can be used to modulate the activity of target gene products 
10 {e.g., Cayman ataxia genes) either directly or indirectly in a therapeutic protocol, to 

elaborate the biological function of the target gene product, or to identify compounds that 
disrupt normal target gene interactions. Compounds that stimulate the activity of a 
variant Cayman ataxia or mimic the activity of a non- functional variant are particularly 
useful in the treatment of ataxia {e.g., Caymans ataxia). 
15 In one embodiment, the invention provides assays for screening candidate or test 

compounds that are substrates of a Cayman ataxia protein or polypeptide or a biologically 
active portion thereof. In another embodiment, the invention provides assays for 
screening candidate or test compounds that bind to or modulate the activity of a Cayman 
ataxia protein or polypeptide or a biologically active portion thereof. 
20 The test compounds of the present invention can be obtained using any of the 

numerous approaches in combinatorial library methods known in the art, including 
biological libraries; peptoid libraries (libraries of molecules having the functionalities of 
peptides, but with a novel, non-peptide backbone, which are resistant to enzymatic 
degradation but which nevertheless remain bioactive; see, e.g., Zuckennann et al, J. Med. 
25 Chem. 37: 2678-85 [1994]); spatially addressable parallel solid phase or solution phase 
libraries; synthetic library methods requiring deconvolution; the 'one-bead one- 
compound' library method; and synthetic library methods using affinity chromatography 
selection. The biological library and peptoid library approaches are preferred for use 
with peptide libraries, while the other four approaches are applicable to peptide, non- 
30 peptide oligomer or small molecule libraries of compounds (Lam (1997) Anticancer Drug 
Des. 12:145). 
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Examples of methods for the synthesis of molecular libraries can be found in the 
art, for example in: DeWitt et al, Proc. Natl. Acad. Sci. U.S.A. 90:6909 [1993]; Erb et 
al, Proc. Nad. Acad. Sci. USA 91:1 1422 [1994]; Zuckermann et al, J. Med. Chem. 
37:2678 [1994]; Cho et al, Science 261:1303 [1993]; Carrell et al, Angew. Chem. Int. 
5 Ed. Engl. 33.2059 [1994]; Carell et al, Angew. Chem. Int. Ed. Engl. 33:2061 [1994]; and 
Gallop etal, J. Med. Chem. 37:1233 [1994]. 

Libraries of compounds may be presented in solution {e.g., Houghten, 
Biotechniques 13:412-421 [1992]), or on beads (Lam, Nature 354:82-84 [1991]), chips 
(Fodor, Nature 364:555-556 [1993]), bacteria or spores (U.S. Patent No. 5,223,409; 

10 herein incorporated by reference), plasmids (Cull et al, Proc. Nad. Acad. Sci. USA 

89:18651869 [1992]) or on phage (Scott and Smith, Science 249:386-390 [1990]; Devlin 
Science 249:404-406 [1990]; Cwirla et al, Proc. Natl. Acad. Sci. 87:6378-6382 [1990]; 
Felici, J. Mol. Biol. 222:301 [1991]). 

In one embodiment, an assay is a cell-based assay in which a cell that expresses a 

1 5 Cayman ataxia protein or biologically active portion thereof is contacted with a test 

compound, and the ability of the test compound to the modulate Cayman ataxia's activity 
is determined. Determining the ability of the test compound to modulate Cayman ataxia 
activity can be accomplished by monitoring, for example, changes in enzymatic activity. 
The cell, for example, can be of mammalian origin. 

20 The ability of the test compound to modulate Cayman ataxia binding to a 

compound, e.g., a Cayman ataxia substrate, can also be evaluated. This can be 
accomplished, for example, by coupling the compound, e.g., the substrate, with a 
radioisotope or enzymatic label such that binding of the compound, e.g., the substrate, to 
a Cayman ataxia can be determined by detecting the labeled compound, e.g., substrate, in 

25 a complex. 

Alternatively, the Cayman ataxia is coupled with a radioisotope or enzymatic 
label to monitor the ability of a test compound to modulate Cayman ataxia binding to a 
Cayman ataxia substrate in a complex. For example, compounds {e.g., substrates) can be 
labeled with 125 1, 35 S l4 C or 3 H, either directly or indirectly, and the radioisotope detected 
30 by direct counting of radioemmission or by scintillation counting. Alternatively, 
compounds can be enzymatically labeled with, for example, horseradish peroxidase, 
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alkaline phosphatase, or luciferase, and the enzymatic label detected by determination of 
conversion of an appropriate substrate to product. 

The ability of a compound (e.g., a Cayman ataxia substrate) to interact with a 
Cayman ataxia with or without the labeling of any of the interactants can be evaluated. 
5 For example, a microphysiorneter can be used to detect the interaction of a compound 
with a Cayman ataxia without the labeling of either the compound or the Cayman ataxia 
(McConnell etal Science 257:1906-1912 [1992]). As used herein, a "microphysiorneter" 
(e.g., Cytosensor) is an analytical instrument that measures the rate at which a cell 
acidifies its environment using a light-addressable potentiometric sensor (LAPS). 

10 Changes in this acidification rate can be used as an indicator of the interaction between a 
compound and Cayman ataxia. 

In yet another embodiment, a cell-free assay is provided in which a Cayman 
ataxia protein or biologically active portion thereof is contacted with a test compound and 
the ability of the test compound to bind to the Cayman ataxia protein or biologically 

1 5 active portion thereof is evaluated. Preferred biologically active portions of the Cayman 
ataxia proteins to be used in assays of the present invention include fragments that 
participate in interactions with substrates or other proteins, e.g., fragments with high 
surface probability scores. 

Cell-free assays involve preparing a reaction mixture of the target gene protein 

20 and the test compound under conditions and for a time sufficient to allow the two 

components to interact and bind, thus forming a complex that can be removed and/or 
detected. 

The interaction between two molecules can also be detected, e.g., using 
fluorescence energy transfer (FRET) (see, for example, Lakowicz et al, U.S. Patent No. 
25 5,631,169; Stavrianopoulos et al, U.S. Patent No. 4,968,103; each of which is herein 
incorporated by reference). A fluorophore label is selected such that a first donor 
molecule's emitted fluorescent energy will be absorbed by a fluorescent label on a 
second, 'acceptor 1 molecule, which in turn is able to fluoresce due to the absorbed energy. 
Alternately, the 'donor' protein molecule may simply utilize the natural 
30 fluorescent energy of tryptophan residues. Labels are chosen that emit different 

wavelengths of light, such that the 'acceptor' molecule label may be differentiated from 
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that of the 'donor 1 . Since the efficiency of energy transfer between the labels is related to 
the distance separating the molecules, the spatial relationship between the molecules can 
be assessed. In a situation in which binding occurs between the molecules, the fluorescent 
emission of the 'acceptor' molecule label in 1 5 the assay should be maximal. An FRET 
5 binding event can be conveniently measured through standard fluorometric detection 
means well known in the art (e.g., using a fluorimeter). 

In another embodiment, determining the ability of the Cayman ataxia protein to 
bind to a target molecule can be accomplished using real-time Biomolecular Interaction 
Analysis (BIA) (see, e.g., Sjolander and Urbaniczky, Anal. Chem. 63:2338-2345 [1991] 

10 and Szabo et al. Curr. Opin. Struct. Biol. 5:699-705 [1995]). "Surface plasmon 

resonance" or "BIA" detects biospecific interactions in real time, without labeling any of 
the interactants (e.g., BlAcore). Changes in the mass at the binding surface (indicative of 
a binding event) result in alterations of the refractive index of light near the surface (the 
optical phenomenon of surface plasmon resonance (SPR)), resulting in a detectable signal 

15 that can be used as an indication of real-time reactions between biological molecules. 

In one embodiment, the target gene product or the test substance is anchored onto 
a solid phase. The target gene product/test compound complexes anchored on the solid 
phase can be detected at the end of the reaction. Preferably, the target gene product can 
be anchored onto a solid surface, and the test compound, (which is not anchored), can be 

20 labeled, either directly or indirectly, with detectable labels discussed herein. 

It may be desirable to immobilize Cayman ataxia, an anti-Cayman ataxia antibody 
or its target molecule to facilitate separation of complexed from non-complexed forms of 
one or both of the proteins, as well as to accommodate automation of the assay. Binding 
of a test compound to a Cayman ataxia protein, or interaction of a Cayman ataxia protein 

25 with a target molecule in the presence and absence of a candidate compound, can be 
accomplished in any vessel suitable for containing the reactants. Examples of such 
vessels include microtiter plates, test tubes, and micro-centrifuge tubes. In one 
embodiment, a fusion protein can be provided which adds a domain that allows one or 
both of the proteins to be bound to a matrix. For example, glutathione-S-transferase- 

30 Cayman ataxia fusion proteins or glutathione-S-transferase/target fusion proteins can be 
adsorbed onto glutathione Sepharose beads (Sigma Chemical, St. Louis, MO) or 
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glutathione-derivatized microtiter plates, which are then combined with the test 
compound or the test compound and either the non-adsorbed target protein or Cayman 
ataxia protein, and the mixture incubated under conditions conducive for complex 
formation (e.g., at physiological conditions for salt and pH). Following incubation, the 
5 beads or microtiter plate wells are washed to remove any unbound components, the 

matrix immobilized in the case of beads, complex determined either directly or indirectly, 
for example, as described above. 

Alternatively, the complexes can be dissociated from the matrix, and the level of 
Cayman ataxia binding or activity determined using standard techniques. Other 

10 techniques for immobilizing either Cayman ataxia protein or a target molecule on 

matrices include using conjugation of biotin and streptavidin. Biotinylated Cayman ataxia 
protein or target molecules can be prepared from biotin-NHS (N-hydroxy-succinimide) 
using techniques known in the art (e.g., biotinylation kit, Pierce Chemicals, Rockford, 
EL), and immobilized in the wells of streptavidin-coated 96 well plates (Pierce 

15 Chemical). 

In order to conduct the assay, the non-immobilized component is added to the 
coated surface containing the anchored component. After the reaction is complete, 
unreacted components are removed (e.g., by washing) under conditions such that any 
complexes formed will remain immobilized on the solid surface. The detection of 

20 complexes anchored on the solid surface can be accomplished in a number of ways. 

Where the previously non-immobilized component is pre-labeled, the detection of label 
immobilized on the surface indicates that complexes were formed. Where the previously 
non-immobilized component is not pre-labeled, an indirect label can be used to detect 
complexes anchored on the surface; e.g., using a labeled antibody specific for the 

25 immobilized component (the antibody, in turn, can be directly labeled or indirectly 
labeled with, e.g., a labeled anti-IgG antibody). 

This assay is performed utilizing antibodies reactive with Cayman ataxia protein 
or target molecules but which do not interfere with binding of the Cayman ataxia protein 
to its target molecule. Such antibodies can be derivatized to the wells of the plate, and 

30 unbound target or Cayman ataxia protein trapped in the wells by antibody conjugation. 
Methods for detecting such complexes, in addition to those described above for the GST- 
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immobilized complexes, include immunodetection of complexes using antibodies 
reactive with the Cayman ataxia protein or target molecule, as well as enzyme-linked 
assays which rely on detecting an enzymatic activity associated with the Cayman ataxia 
protein or target molecule. 
5 Alternatively, cell free assays can be conducted in a liquid phase. In such an 

assay, the reaction products are separated from unreacted components, by any of a 
number of standard techniques, including, but not limited to: differential centrifugation 
(see, for example, Rivas and Minton, Trends Biochem Sci 18:284-7 [1993]); 
chromatography (gel filtration chromatography, ion-exchange chromatography); 

10 electrophoresis (see, e.g., Ausubel et al, eds. Current Protocols in Molecular Biology 
1999, J. Wiley: New York.); and immunoprecipitation (see, for example, Ausubel et al, 
eds. Current Protocols in Molecular Biology 1999, J. Wiley: New York). Such resins and 
chromatographic techniques are known to one skilled in the art (See e.g., Heegaard J. 
Mol. Recognit 1 1:141-8 [1998]; Hageand Tweed J. Chromatogr. Biomed. Sci. Appl 

15 699:499-525 [1997]). Further, fluorescence energy transfer may also be conveniently 
utilized, as described herein, to detect binding without further purification of the complex 
from solution. 

The assay can include contacting the Cayman ataxia protein or biologically active 
portion thereof with a known compound that binds the Cayman ataxia to form an assay 
20 mixture, contacting the assay mixture with a test compound, and determining the ability 
of the test compound to interact with a Cayman ataxia protein, wherein determining the 
ability of the test compound to interact with a Cayman ataxia protein includes 
determining the ability of the test compound to preferentially bind to Cayman ataxia or 
biologically active portion thereof, or to modulate the activity of a target molecule, as 
25 compared to the known compound. 

To the extent that Cayman ataxia can, in vivo, interact with one or more cellular 
or extracellular macromolecules, such as proteins, inhibitors of such an interaction are 
useful. A homogeneous assay can be used can be used to identify inhibitors. 

For example, a preformed complex of the target gene product and the interactive 
30 cellular or extracellular binding partner product is prepared such that either the target 

gene products or their binding partners are labeled, but the signal generated by the label is 
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quenched due to complex formation (see, e.g., U.S. Patent No. 4,109,496, herein 
incorporated by reference, that utilizes this approach for immunoassays). The addition of 
a test substance that competes with and displaces one of the species from the preformed 
complex will result in the generation of a signal above background. In this way, test 
substances that disrupt target gene product-binding partner interaction can be identified. 
Alternatively, Cayman ataxia protein can be used as a "bait protein" in a two-hybrid assay 
or three-hybrid assay {see, e.g., U.S. Patent No. 5,283,317; Zervos et al, Cell 72:223-232 
[1993]; Madura et al, J. Biol. Chem. 268.12046-12054 [1993]; Bartel et al, 
Biotechniques 14:920-924 [1993]; Iwabuchi et al, Oncogene 8:1693-1696 [1993]; and 
Brent WO 94/10300; each of which is herein incorporated by reference), to identify other 
proteins, that bind to or interact with Cayman ataxia ("Cayman ataxia-binding proteins" 
or "Cayman ataxia-bp") and are involved in Cayman ataxia activity. Such Cayman 
ataxia-bps can be activators or inhibitors of signals by the Cayman ataxia proteins or 
targets as, for example, downstream elements of a Cayman ataxia-mediated signaling 
pathway. 

Modulators of Cayman ataxia expression can also be identified. For example, a 
cell or cell free mixture is contacted with a candidate compound and the expression of 
Cayman ataxia mRNA or protein evaluated relative to the level of expression of Cayman 
ataxia mRNA or protein in the absence of the candidate compound. When expression of 
Cayman ataxia mRNA or protein is greater in the presence of the candidate compound 
than in its absence, the candidate compound is identified as a stimulator of Cayman 
ataxia mRNA or protein expression. Alternatively, when expression of Cayman ataxia 
mRNA or protein is less (i.e., statistically significantly less) in the presence of the 
candidate compound than in its absence, the candidate compound is identified as an 
inhibitor of Cayman ataxia mRNA or protein expression. The level of Cayman ataxia 
mRNA or protein expression can be determined by methods described herein for 
detecting Cayman ataxia mRNA or protein. 

A modulating agent can be identified using a cell-based or a cell free assay, and 
the ability of the agent to modulate the activity of a Cayman ataxia protein can be 
confirmed in vivo, e.g., in an animal such as an animal model for a disease {e.g., an 
animal with ataxia). 
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C. Therapeutic Agents 

This invention further pertains to novel agents identified by the above-described 
screening assays. Accordingly, it is within the scope of this invention to further use an 
5 agent identified as described herein (e.g., a Cayman ataxia modulating agent or mimetic, 
a Cayman ataxia specific antibody, or a Cayman ataxia-binding partner) in an appropriate 
animal model (such as those described herein) to determine the efficacy, toxicity, side 
effects, or mechanism of action, of treatment with such an agent. Furthermore, novel 
agents identified by the above-described screening assays can be, e.g., used for treatments 
10 of ataxia (e.g., including, but not limited to, Caymans ataxia). 

IX. Pharmaceutical Compositions Containing Cayman ataxia Nucleic Acid, 
Peptides, and Analogs 

The present invention further provides pharmaceutical compositions which may 

1 5 comprise all or portions of Cayman ataxia polynucleotide sequences, Cayman ataxia 

polypeptides, inhibitors or antagonists of Cayman ataxia bioactivity, including antibodies, 
alone or in combination with at least one other agent, such as a stabilizing compound, and 
may be administered in any sterile, biocompatible pharmaceutical carrier, including, but 
not limited to, saline, buffered saline, dextrose, and water. 

20 The methods of the present invention find use in treating diseases or altering 

physiological states characterized by mutant Cayman ataxia alleles (e.g., Cayman ataxia). 
Peptides can be administered to the patient intravenously in a pharmaceutically 
acceptable carrier such as physiological saline. Standard methods for intracellular 
delivery of peptides can be used (e.g., delivery via liposome). Such methods are well 

25 known to those of ordinary skill in the art. The formulations of this invention are useful 
for parenteral administration, such as intravenous, subcutaneous, intramuscular, and 
intraperitoneal. Therapeutic administration of a polypeptide intracellularly can also be 
accomplished using gene therapy as described above. 

As is well known in the medical arts, dosages for any one patient depends upon 

30 many factors, including the patient's size, body surface area, age, the particular compound 
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to be administered, sex, time and route of administration, general health, and interaction 
with other drugs being concurrently administered. 

Accordingly, in some embodiments of the present invention, Cayman ataxia 
nucleotide and Cayman ataxia amino acid sequences can be administered to a patient 
5 alone, or in combination with other nucleotide sequences, drugs or hormones or in 
pharmaceutical compositions where it is mixed with excipient(s) or other 
pharmaceutically acceptable carriers. In one embodiment of the present invention, the 
pharmaceutically acceptable carrier is pharmaceutically inert. In another embodiment of 
the present invention, Cayman ataxia polynucleotide sequences or Cayman ataxia amino 
10 acid sequences may be administered alone to individuals subject to or suffering from a 
disease. 

Depending on the condition being treated, these pharmaceutical compositions 
may be formulated and administered systemically or locally. Techniques for formulation 
and administration may be found in the latest edition of "Remington's Pharmaceutical 
15 Sciences" (Mack Publishing Co, Easton Pa.). Suitable routes may, for example, include 
oral or transmucosal administration; as well as parenteral delivery, including 
intramuscular, subcutaneous, intramedullary, intrathecal, intraventricular, intravenous, 
intraperitoneal, or intranasal administration. 

For injection, the pharmaceutical compositions of the invention may be 
20 formulated in aqueous solutions, preferably in physiologically compatible buffers such as 
Hanks' solution, Ringer's solution, or physiologically buffered saline. For tissue or 
cellular administration, penetrants appropriate to the particular barrier to be permeated 
are used in the formulation. Such penetrants are generally known in the art. 

In other embodiments, the pharmaceutical compositions of the present invention 
25 can be formulated using pharmaceutically acceptable carriers well known in the art in 
dosages suitable for oral administration. Such carriers enable the pharmaceutical 
compositions to be formulated as tablets, pills, capsules, liquids, gels, syrups, slurries, 
suspensions and the like, for oral or nasal ingestion by a patient to be treated. 

Pharmaceutical compositions suitable for use in the present invention include 
30 compositions wherein the active ingredients are contained in an effective amount to 

achieve the intended purpose. For example, an effective amount of Cayman ataxia may 
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be that amount that suppresses apoptosis. Determination of effective amounts is well 
within the capability of those skilled in the art, especially in light of the disclosure 
provided herein. 

In addition to the active ingredients these pharmaceutical compositions may 
5 contain suitable pharmaceutical^ acceptable carriers comprising excipients and 

auxiliaries that facilitate processing of the active compounds into preparations that can be 
used pharmaceutically. The preparations formulated for oral administration may be in 
the form of tablets, dragees, capsules, or solutions. 

The pharmaceutical compositions of the present invention may be manufactured 

10 in a manner that is itself known (e.g. , by means of conventional mixing, dissolving, 
granulating, dragee-making, levigating, emulsifying, encapsulating, entrapping or 
lyophilizing processes). 

Pharmaceutical formulations for parenteral administration include aqueous 
solutions of the active compounds in water-soluble form. Additionally, suspensions of 

15 the active compounds may be prepared as appropriate oily injection suspensions. 

Suitable lipophilic solvents or vehicles include fatty oils such as sesame oil, or synthetic 
fatty acid esters, such as ethyl oleate or triglycerides, or liposomes. Aqueous injection 
suspensions may contain substances that increase the viscosity of the suspension, such as 
sodium carboxymethyl cellulose, sorbitol, or dextran. Optionally, the suspension may 

20 also contain suitable stabilizers or agents that increase the solubility of the compounds to 
allow for the preparation of highly concentrated solutions. 

Pharmaceutical preparations for oral use can be obtained by combining the active 
compounds with solid excipient, optionally grinding a resulting mixture, and processing 
the mixture of granules, after adding suitable auxiliaries, if desired, to obtain tablets or 

25 dragee cores. Suitable excipients are carbohydrate or protein fillers such as sugars, 

including lactose, sucrose, mannitol, or sorbitol; starch from corn, wheat, rice, potato, etc; 
cellulose such as methyl cellulose, hydroxypropylmethyl-cellulose, or sodium 
carboxymethylcellulose; and gums including arabic and tragacanth; and proteins such as 
gelatin and collagen. If desired, disintegrating or solubilizing agents may be added, such 

30 as the cross-linked polyvinyl pyrrolidone, agar, alginic acid or a salt thereof such as 
sodium alginate. 
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Dragee cores are provided with suitable coatings such as concentrated sugar 
solutions, which may also contain gum arabic, talc, polyvinylpyrrolidone, carbopol gel, 
polyethylene glycol, and/or titanium dioxide, lacquer solutions, and suitable organic 
solvents or solvent mixtures. Dyestuffs or pigments may be added to the tablets or 
5 dragee coatings for product identification or to characterize the quantity of active 
compound, (i.e., dosage). 

Pharmaceutical preparations that can be used orally include push-fit capsules 
made of gelatin, as well as soft, sealed capsules made of gelatin and a coating such as 
glycerol or sorbitol. The push-fit capsules can contain the active ingredients mixed with 

10 a filler or binders such as lactose or starches, lubricants such as talc or magnesium 
stearate, and, optionally, stabilizers. In soft capsules, the active compounds may be 
dissolved or suspended in suitable liquids, such as fatty oils, liquid paraffin, or liquid 
polyethylene glycol with or without stabilizers. 

Compositions comprising a compound of the invention formulated in a 

1 5 pharmaceutical acceptable carrier may be prepared, placed in an appropriate container, 
and labeled for treatment of an indicated condition. For polynucleotide or amino acid 
sequences of Cayman ataxia, conditions indicated on the label may include treatment of 
condition related to apoptosis. 

The pharmaceutical composition may be provided as a salt and can be formed 

20 with many acids, including but not limited to hydrochloric, sulfuric, acetic, lactic, tartaric, 
malic, succinic, etc. Salts tend to be more soluble in aqueous or other protonic solvents 
that are the corresponding free base forms. In other cases, the preferred preparation may 
be a lyophilized powder in 1 mM-50 mM histidine, 0.1%-2% sucrose, 2%-7% mannitol 
at a pH range of 4.5 to 5.5 that is combined with buffer prior to use. 

25 For any compound used in the method of the invention, the therapeutically 

effective dose can be estimated initially from cell culture assays. Then, preferably, 
dosage can be formulated in animal models (particularly murine models) to achieve a 
desirable circulating concentration range that adjusts Cayman ataxia levels. 

A therapeutically effective dose refers to that amount of Cayman ataxia that 

30 ameliorates symptoms of the disease state. Toxicity and therapeutic efficacy of such 
compounds can be determined by standard pharmaceutical procedures in cell cultures or 
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experimental animals, e.g., for determining the LD50 (the dose lethal to 50% of the 
population) and the ED50 (the dose therapeutically effective in 50% of the population). 

The dose ratio between toxic and therapeutic effects is the therapeutic index, and it can be 
expressed as the ratio LD50/ED50. Compounds that exhibit large therapeutic indices are 
preferred. The data obtained from these cell culture assays and additional animal studies 
can be used in formulating a range of dosage for human use. The dosage of such 
compounds lies preferably within a range of circulating concentrations that include the 
ED50 with little or no toxicity. The dosage varies within this range depending upon the 

dosage form employed, sensitivity of the patient, and the route of administration. 

The exact dosage is chosen by the individual physician in view of the patient to be 
treated. Dosage and administration are adjusted to provide sufficient levels of the active 
moiety or to maintain the desired effect. Additional factors which may be taken into 
account include the severity of the disease state; age, weight, and gender of the patient; 
diet, time and frequency of administration, drug combination(s), reaction sensitivities, 
and tolerance/response to therapy. Long acting pharmaceutical compositions might be 
administered every 3 to 4 days, every week, or once every two weeks depending on 
half-life and clearance rate of the particular formulation. 

Normal dosage amounts may vary from 0.1 to 100,000 micrograms, up to a total 
dose of about 1 g, depending upon the route of administration. Guidance as to particular 
dosages and methods of delivery is provided in the literature (See, U.S. Pat. Nos. 
4,657,760; 5,206,344; or 5,225,212, all of which are herein incorporated by reference). 
Those skilled in the art will employ different formulations for Cayman ataxia than for the 
inhibitors of Cayman ataxia. Administration to the bone marrow may necessitate 
delivery in a manner different from intravenous injections. 

EXPERIMENTAL 

The following examples are provided in order to demonstrate and further illustrate 
certain preferred embodiments and aspects of the present invention and are not to be 
construed as limiting the scope thereof. 

In the experimental disclosure which follows, the following abbreviations apply: 
eq (equivalents); M (Molar); nM (micromolar); N (Normal); mol (moles); mmol 
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(millimoles); ^mol (micromoles); nmol (nanomoles); g (grams); mg (milligrams); ^g 
(micrograms); ng (nanograms); 1 or L (liters); ml (milliliters); nl (microliters); cm 
(centimeters); mm (millimeters); ^m (micrometers); nm (nanometers); °C (degrees 
Centigrade); U (units), mU (milliunits); min. (minutes); sec. (seconds); % (percent); kb 
5 (kilobase); bp (base pair); PCR (polymerase chain reaction); BSA (bovine serum 
albumin); Fisher (Fisher Scientific, Pittsburgh, PA); Sigma (Sigma Chemical Co., St. 
Louis, MO.); Promega (Promega Corp., Madison, WI); Perkin-Elmer (Perkin- 
Elmer/Applied Biosystems, Foster City, CA); Boehringer Mannheim (Boehringer 
Mannheim, Corp., Indianapolis, IN); Clonetech (Clonetech, Palo Alto, CA); Qiagen 
10 (Qiagen, Santa Clarita, CA); Stratagene (Stratagene Inc., La Jolla, CA); National 
Biosciences (National Biosciences Inc, Plymouth Minn.) and NEB (New England 
Biolabs, Beverly, MA), wt (wild-type); Ab (antibody). 

Example 1 
1 5 Genetic mapping of j ittery/h esitan t 

Both jittery and hesitant were genetically mapped in ¥2%ji in a large (>3000 
animals, >6000 meioses) intersubspecific cross with Mus musculus castaneus (strain 
CASA/Rk), hes in a small cross of (C3H-HeJ-/iey x C57BL/6J) F2. After analyzing more 
than 6000 meioses (>3000 animals), the genetically defined ji interval was narrowed 

20 down to about 800 kb, much larger than was expected based on the number of mice. 
Tbxa2r and D10Bwgl364 were identified as the closest flanking (excluded) markers. 

The physical mapping of this region of mouse Chromosome 10 was completed 
using pulsed field gel electrophoresis, showing conserved gene order and distances 
between mouse Chromosome 10 and human 19p 13.3, but interrupted at the end by an ~ 

25 1 000 kb inversion (Figure 5). 

A recessive form of ataxia prevalent on the Cayman islands has been mapped to 
human Chromosome 19pl3.3 (Nystuen et al, Hum. Mol. Genet. 5:525 [1996]). Only a 
few clinical details have been described in abstracts (Brown et al, Neurology 34:273 
[1984]; Johnson et al, Neurology 28:352 [1978]). 

30 Comparison of the 19pl3 map and the mouse Chromosome 10 map makes it clear 

that the nonrecombinant regions of jittery and Cayman ataxia overlap. While each region 
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individually has been mapped to about 800-1000 kb, comparison of the human and 
mouse genome sequence shows that the region of overlap is about 250 kb. Given the 
variability of the different alleles of jittery and hesitant, there is clear similarity in 
phenotype. 

5 Within the region of overlap, there were about 13 known or predicted genes - the 

precise number is unknown since different prediction programs sometimes disagree 
whether there are 2 genes or a single one and only after testing can these options be 
discriminated. 

These genes were focused on as candidate genes, rather than the >100 in the 
1 0 genetic interval of jittery. Each gene was first screened by ordering an IMAGE cDNA 
clone, often two, one for the 3' region and one for the 5' region, isolated the inserts, and 
hybridized them to Southern blots prepared from DNA of different mutant alleles cleaved 
with a variety of enzymes. Table 1 lists the results of those genes tested. In a few cases 
(e.g. Mek2), Northern blots were prepared, each exon was sequenced, and the protein 
15 levels and sizes between mutants and controls were compared. 



Table 1 


Gene 


Southern 


Northern 


Sequence 


D10Burle=chrl0 7.293 


Negative 






Map2k2=mek2=erkl 


Negative 


Negative 


Negative 


Lrf 








Piasg 


Negative 






Eef2 


Negative 






Dapk3= zip kinase 


Negative 


Negative 


Negative 


chrl0_7.297- integrin related 
(is a different gene than below) 








chrlO 7.297 KIAA1 872 


Positive 


ongoing 


Changes 


chrl0_7.298 (zinc finger cont.) 


Positive 


Negative 


negative 


Math 


Negative 


Negative 


Negative 
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Mrpl54=D10Sutle=Sc32 


Negative 






Apba3 








Tjp3 


Negative 






PipSklc 


Negative 







Table 1 : Candidate gene testing performed in the nonrecombinant jittery/Cayman Ataxia 
region. Negative: no difference between mutant and wild type detected. Empty cells: not 
done. 



5 Example 2 

Identification of the jittery gene. 

.After excluding many different candidate genes, a Southern blot change was 
detected in homozygous hesitant mice with a probe from the 5' end of a Zinc finger- 
containing gene. Comparing the restriction map of the region with the hybridization data 

10 from several enzymes lead to the conclusion that the actual difference is in a gene 5' to 
the Zinc-finger-containing gene. This jittery gene in the human databases is called 
KIAA1872, in mouse it is only part of a predicted gene called chrl0_7.297. ESTs in 
IMAGE clones were available from mouse. Another probe was examined, this time for 
the mouse homologue of KIAA1872, and found with that probe, in addition to hesitant, 

15 there is also a Southern blot change in jittery. The products were then sequenced. It was 
determined that the hesitant allele is an insertion of an IAP element in intron 1 of the 
gene, whereas jittery is a small Bl element insertion in exon 4. Jittery is an old mutation 
and no appropriate control strain is available but the insertion predicts that after 65 (of a 
total of 372) amino acids, the protein will stop, and thus jittery is likely a null allele. In 

20 contrast, an insertion in an intron is likely often spliced across, so hesitant, which has a 
phenotype that is much milder, and is a viable allele, is likely to be a hypomorph with 
reduced expression due to the presence of a large intronic insertion. Because hesitant 
arose on a known strain, C3H/HeJ-TyKc-a> (Jackson Laboratory), the fragments were 
compared by PCR, and it was determined that C3H/HeJ (the strain on which hes is now 

25 maintained) and C3H/HeJ-Tyr<c-a> both do not have the insertion^ Given the fact that 
two different changes were detected in two different alleles, one which arose on a known 
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genetic background that does not carry the mutation, the identity of jittery/hesitant with 
the gene encoded by the mouse homologue of cDNA KIAA1872 was been established. 

The jittery protein is predicted not to be secreted (no leader peptide), and is not 
likely to have a transmembrane region, since there is no extended region of 
5 hydrophobicity long enough to span the membrane (Figure 7). 

Example 3 

Identification of Cayman Ataxia mutations 

Genomic DNA of patients with Cayman ataxia was obtained. All coding exons 
10 and exon-intron junctions were sequenced and compared to control DNA sequences as 
well as to the databank sequence of KIAA1872. A point mutation that results in a non- 
conservative amino acid change in one exon (SEQ ID NO: 8 (nucleic acid) and SEQ ID 
NO:9 (amino acid) and another point mutation at a splice junction (SEQ ED NO: 10) were 
identified. 

15 Given the genetic mapping data and the phenotypic similarities, it was concluded 

that the jittery gene is the Cayman ataxia gene. Because the different alleles of jittery 
have different phenotypes, other ataxia or dystonias in humans may also have mutations 
in the same gene. 

20 Example 4 

Characterization of the "KI AA1 872" protein 

The KIAA1872 (identified above as the Cayman ataxia protein) protein is a 
protein of unknown function. According to the predicted proteins found at various 
genome sites, it isn't even clear what the N-terminal amino acid, i.e. start of translation, 
25 of the protein is. The sequence of macaque, mouse and human of the predicted protein, 
and they are highly conserved. In contrast, the sequences of mouse and human protein 
just before the presumed translation start are not homologous (See below), indicating the 
MGTT is indeed the start of the protein. 

30 PSSDAESAPASILFL 

LGSEGPGSVSDAQLHPGRARLCLPVRRRGCLSCRGVIPASSQCLFPAPMGTTEAT ( SEQ ID 
NO: 5) 
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ASFHQAPRLGTIEKCPPLCPSDSAEAASATEIIFWVTRVSRPLLFPALMGTTEAT (SEQ ID 
N0:6) 



5 The 5' sequence near this methionine has partial homology to the Kozak 

consensus sequence: CCAGCTCT CATGG matches 8/13 to GCCGCCACCAUGG - one 
of very few matches in the region of interest, and the first Methionine after an in-frame 
stop. 

As shown in Figure 6, the predicted protein is about 54% identical and highly 

10 homologous to the NIP2 (also called NIP21) protein of human and mouse. This protein 
has been initially identified as binding to Adenovirus El 9 and to BCL-2, a known 
apoptosis inhibitor. Subsequently, NIP2 has been found to be downregulated by 
estrogen, and in- vitro transfection of NIP2 constructs result in cells undergoing apoptotic 
cell death, whereas co-transfection of NIP2 with BCL-2 constructs reduce the amount of 

15 NIP2-induced apoptotic cell death. In addition, several reports show that NIP2 mRNA is 
downregulated in response to estrogens (e.g., Brusadelli et ai 9 Int J Dev Neurosci. 18:317 
[2000]), which is known to act anti-apoptoticly. These authors postulate that NIP2 is an 
important mediator of the anti-apoptotic property of estrogens in neuronal cells (Maggi, 
2000). Last, it was shown that the apoptotic activity of NIP2 acts via a caspase- 

20 dependent mitochondrial activation (Zhang et al , Circ. Res. 90: 125 1 [2002]) 

While only about a dozen papers have been published on NIP2, they all support 
the idea that NEP2 is an anti-apoptotic protein, binds BCL-2 and is negatively regulated 
by estrogen. The present invention is not limited to a particular mechanism. Indeed, an 
understanding of the mechanism is not necessary to practice the present invention. 

25 Nonetheless, it is contemplated that, given that KIAA1 872/jittery protein is highly 

homologous, that these reports provide information on the function of the human Cayman 
ataxia/KIAA1872 protein. While many ataxias are presumably neurodegenerative, 
reduction in apoptosis has been proposed as a mechanism in one other recessive ataxia in 
humans, namely in Ataxia Telangiectasia (Lee and McKinnon, Apoptosis 5:523 [2000]). 

30 The present invention is not limited to a particular mechanism. Indeed, an 

understanding of the mechanism is not necessary to practice the present invention. 
Nonetheless, it is contemplated that the identification of mutation in the Cayman ataxia 
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gene and polypeptide disclosed herein find use in the diagnosis of ataxia or related 
disorders such as dystonia (characterized by involuntary muscle contractions that force a 
certain part(s) of the body into abnormal, sometimes painful, movements and positions), 
myoclonus (characterized by sudden, involuntary contractions of skeletal muscles) and 
5 nystagmus (characterized by involuntary, rapid movement of the eyeball). 



Example 5 

10 Mutations in a novel gene encoding a CRAL-TRIO domain cause Human Cayman 
Ataxia and ataxia/dystonia in the jittery mouse 

A. Materials and Methods 

15 Human DNA samples: Subjects (affected patients and family members) were recruited 
and gave consent several years ago, with approval by University of Miami IRB. DNA 
was extracted from blood and has previously been used to identify linkage and a region 
of homozygosity as described before (AN, VCS), approved by Iowa IRB (Nystuen et al., 
Hum. Mol. Genet. 5:525 [1996]). DNA was analyzed for mutations blind for affectation 

20 and family status, with approval by Michigan IRB. Complete concordance between 
genotypes and phenotypes was found. 

Genomic control samples were genotyped from 20 Caucasian US men 
(anonymized controls), a panel of 90 US ethnically diverse samples (Coriell, Camden, 
NJ), 84 British, 88 Jamaican, 185 Akan from Ghana and 96 Bamileke from Cameroon - 

25 the latter because the Cayman Islanders are of mixed, primarily British and Jamaican, 
ancestry (Drewett et al., in Prehistoric Settlements in the Caribbean 5-16 (Archetype 
Publications, London, UK, 2000), with Jamaicans originating primarily on the west coast 
of Africa. British genomic DNA samples were sent anonymized by Mike Owen's 
laboratory (Univ. of Wales). Genomic DNA from anonymous West Africans from 

30 Accra, Ghana (N=300) and Caribbeans from Kingston, Jamaica (N=200) were collected 
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for population-based genetic studies on DNA variation in the African Diaspora and were 
used as control samples with IRB approval of Howard University. 



Fine mapping of Cayman Ataxia 

, 5 Previously a 2500 kb haplotype on 19pl3.3 between GATA66B01 (now 

D19S1034) and D19S209, with D19S216 nonrecombinant, was reported in all Cayman 
Ataxia patients (Nystuen et al., supra). Novel polymorphic markers were developed by 
text word searches on sequence data for short tandem repeat polymorphisms (STRPs). 
Amplification of STRP markers was performed with 40 ng of DNA in an 8.3 ^il PCR 

10 reaction mixture containing, 1.25 \i\ PCR buffer (100 mM Tris/HCl pH 8.8, 500 mM 
KC1, 15 mM MgCl2, 0.01% w/v gelatin), 200 \M each dNTP, 2.5 pmol of each primer, 
and 0.25 U Tag polymerase. Samples were subjected to 40 cycles of 94°C for 30 s, 55°C 
for 30 s, and 72°C for 30 s. Amplification products were denatured for 5 min at 95°C and 
loaded onto a 6% denaturing polyacrylamide gel (7.7 M urea). Electrophoresis was 

15 carried out at 60W for 2 hours. The polyacrylamide gels were visualized by silver 
staining. Single nucleotide polymorphisms (SNPs) were identified by non-denaturing 
SSCP gel electrophoresis (6% 49:1 acrylamide:bis-acrylamide containing 5% glycerol) of 
PCR products from various STSs known to map to the interval, ESTs, and gene 
sequencing. 

20 

Mice: 

Originally, all mice were obtained from The Jackson Laboratory (Bar Harbor, 
ME), and subsequently bred. The F2 cross was described previously (Kapfhamer et al., 
Genomics 35:533 [1996]). In addition to ji (jittery) and ji hes (hesitant), one additional 
25 allele was identified in 1999 by the Jackson Laboratory. This allele, sidewinder, jf**, is 
phenotypically very similar to jittery. Allelism was established by complementation 
tests. 

Fine genetic mapping of jittery: 

30 A previously described cross was continued for a total of about 3000 offspring, or 

6000 meioses. Resulting F2 animals were genotyped for the flanking microsatellite 
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markers D10MU140 and D10MU207 as described (Kapfhamer et al., supra). When 
needed, progeny testing was performed with heterozygous recombinants (i.e. if the 
recombinant was between a CAST/Ei homozygous and a heterozygous genotype). Only 
recombinants between flanking markers were further analyzed. Recombinants were 
followed up by analysis with D1QMU21, D10MU23, D10MU226 and D10MU7 and 
D10MU22 using primers described previously (Dietrich et al., Genetics 131:423 [1992]), 
as well as GnalS (see Fig. 12 for map positions). GnalS was genotyped by amplification 
with primers CT222 and CT223 (GAGAACGTGATTGCCCTCATC (SEQ ID NO: 12) 
and GGAGGTGTGAATCTTATCTTC (SEQ ID NO:13)) based on the sequence of the 
mouse cDNA. These primers span an intron that differs in length, resulting in amplified 
fragments 1.1 kb in size in CAST/Ei and 1.4 kb in most laboratory mouse strains. This 
cross identified D10MU21/23 and GnalS as the closest flanking genetic markers. 
D10MU21 and D10MU23 were never genetically separated, and D10MU7 was 
nonrecombinant with ji. To further narrow the nonrecombinant region, Southern blots 
were tested for RFLPs in genes from the region, and RFLPs were used to type critical 
recombinants as described (Kapfhamer et al, supra). The following enzymes that 
showed RFLPs for previously described cDNA probes (Puttagunta etl al., Genome Res. 
10:1369 [2000]) were used to map the genes relative to ji: Ap3d: Hindlll; D10Bwgl364: 
Hindlll; Tbxa2r\ Bglll; Matt AccI; DIOSutle: Bglll and Accl. The results showed that 
D10Bwgl364 and Tbxa2r flank ji as the closest excluded genes. The interval identified is 
about 800 kb, much larger than expected from 6000 informative meioses: With the 
typical ratio of 1600-2000 kb/cM, it was expected that 6000 meioses would get a 
resolution of 25-33 kb, but this genetic interval recombines less often than expected. 
This nonrecombinant region contains over 100 genes. However, limiting the search to 
the region of overlap with Cayman Ataxia left only about 9 known or predicted genes, 
which were tested by Southern blots for mutations in jittery alleles. 

Southern Blot: 

Genomic DNA of all three jittery alleles was cleaved with 6 different restriction 
enzymes, Accl, Mspl, Taq I, Hindlll, PstI and PvuII, separated by 0.8% agarose gel 
electrophoresis, alkaline blotted to nylon membranes (Hybond N+, Amersham, 
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Piscataway, NJ) and hybridized to cDNA probes for the 5' and 3' ends of each of the 14 
known or predicted genes as previously described (Reese et al, J. Comput. Biol. 4:31 1 
[1997]. The cDNA probes were generated from IMAGE clones ordered from Research 
Genetics. Inserts were amplified by PCR with flanking primers, typically corresponding 
5 to the SP6, T7 or T3 sequences of the IMAGE clones. 

Northern blot: 

10-3(Higpoly (A) RNA was fractionated on a 1% agarose gel containing lx Gel 
Prep/Gel Running Buffer and transferred to HybondTM-N + (Amersham, Piscataway, NJ) 

1 0 using the NorthernMax-Gly system (Ambion, Austin, TX, USA) according to 

manufacturer's protocol, and using a commercial RNA size standard (.363-9.488 Kb, 
USB, Cleveland, OH). Prehybridization and hybridization were carried out for 30 min 
and 1 hr respectively using Express Hyb (Clontech, Palo Alto, CA) hybridization solution 
at 68°C. Blots were washed four times with 2 X SSC containing 0.05% SDS at room 

15 temperature for 10 min each, followed by two washes with 0.1 X SSC containing 6.1% 
SDS at 50° for 20 min each. cDNA probes were prepared from RT-PCR products or 
IMAGE clones by P 32 radioactively labeling using Random Primed DNA Labeling Kit 
(Roche Applied Science, Indianapolis, IN). Blots were exposed to BioMax MS film 
(Eastman Kodak Company, Rochester, NY, USA). 

20 

Genomic PCR 

Genomic PCR was performed on 50-100 ng template DNA (tail or spleen 
preparation from mouse, whole blood or lymphocyte DNA from human) using a PTC 100 
thermal cycler (MJ Research, Watertown, MA, USA). Primers were designed using the 

25 Web-based Primer 3 program (Burge et al., J. Mol.Biol. 268:78 [1997]), using the 

sequence surrounding KIAA1872 (NMJB3064) from various publicly available Human 
Genome Internet servers. Products smaller than 1 Kb were amplified in a 20jal reaction 
mix containing 4mM dNTPs, 3|iM primers, 2^1 of lOx Optiprime Buffer (Stratagene, La 
Jolla, CA, (see Table 2 (Figure 15) for sequences of all primer sequences), and 2 U of 

30 Taq polymerase. Conditions used for PCR were 95° C for 1 min, 40 cycles of 95° C for 
30 sec, 56°-61° C for 1 min, 72° C for 1 min, with final extension at 72° C for 10 min. 
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Products larger than 1 kb were amplified in a lO^il reaction mix containing lOmM 
dNTPs, 3\iM primers, 1 jlxI Expand Long Template 10X Buffer (Hoffman LaRoche, Basel, 
Switzerland), and 0.75 U Expand Long Template Enzyme mix. PCR conditions were 94° 
C for 2 min, 9 cycles of 94°C for 10 sec, 56°-60° for 30 sec, 68°C for 2 min 30 sec, 20 
5 cycles of 94°C for 15 sec, 56°-60° C for 30 sec, 68°C for a start of 2 minutes 50 sec 
incremented by 20 sec per cycle, and 7 min at 68° C. PCR products were separated by 
agarose gel electrophoresis and purified using QIAquick PCR Purification or QIAquick 
Gel Extraction Kits (Qiagen, Valencia, CA, USA). Sequencing was performed by the 
University of Michigan's sequencing core, and was analyzed using Lasergene's Seqman 
1 0 analysis program. 

Human Mutation screen: 

The C to G change at the 37 th base of ATCAY (KIAA1872) exon nine and a G to T 
change three bases after exon 9 are predicted to remove an Alu I and a TspRI site, 

1 5 respectively. Amplification with primers "exon 9 left" and "exon 9 right" (see Table 2) 
results in a PCR product 272 bp. In control, but not mutant samples this product is 
cleaved by Alu I into fragments of 174 and 98 bp, PCR amplification with primers "exon 
9 left" and "exon 9 right (a)" Upon TspRI digestion, mutant samples result in fragments 
of 634, 559, 75, and 21 base pairs, whereas control samples result in fragments of 415, 

20 340, 219, 75, and 21 base pairs. This complex pattern is due to partial cleavage at one of 
the TspRI restriction sites not related to Cayman Ataxia. 

RT-PCR: 

Total RNA was isolated from brain using TRIzol Reagent (Invitrogen Life 
25 Technologies, Carlsbad, CA, USA) according to manufacturer's instructions. Poly (A)+ 
RNA was extracted from total RNA using PolyATtract (Promega, Madison, WI, USA). 
cDNA was transcribed from 50-500 ng poly (A)+ RNA using Superscript II RNase H- 
Reverse Transcriptase according to manufacturer's protocol (Invitrogen life technologies, 
Carlsbad, CA, USA). ji hes /ji hes cDNA was transcribed using Superscript III RNAse H- 
30 Reverse Transcriptase with a gene specific primer (exon 3-8 right) and was incubated at 
55°C (Invitrogen life technologies, Carlsbad, CA, USA). 50-100 ng of cDNA were used 
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as a template under the PGR conditions listed above. RT-PCR products were separated on 
a 1% agarose gel, visualized with by Ethidium Bromide staining, purified using 
QIAquick PCR Purification or QIAquick Gel Extraction Kits (Qiagen, Valencia, CA), 
and sequenced at the University of Michigan Sequencing Core Facility. The partial 
5 mouse cDNA sequence of Atcay has genbank accession # AY3491 50, the jittery and 
hesitant mutations are #AY349151 (Bl insertion), AY349152 and AY349153 (two ends 
of the LAP element insertion). 

Splice prediction: 

1 0 To evaluate whether the mutation in intron 9 of Cayman ataxia patients is likely to 

interfere with splicing, the sequence was analyzed. First, inspection of the splice site 
shows that the junction sequence is atypical: While (C/A)AG|gt(a/g)agt is the typical 
consensus of a splice donor, the exon 9/intron 9 boundary of ATCA Y has the sequence: 
TGCA|gtgagt, whereas the mutant junction is TGCAgttagt. Nevertheless, this is the 

1 5 correct splice site based on numerous EST and RT-PCR sequences. To determine the 

consequence of the mutation, a splice donor prediction program based on neural networks 
was used (Herman et al, Endocrinology 127:2408 [1990], available for human genome 
on the world wide web site offhiitfly.org. This program identified many potential donor 
sites in and around exon 9 and intron 9, including the correct splice donor site mentioned 

20 above. The score at the exon 9/intron 9 junction was 0.78, corresponding to about 78% 
chance this is a real splice donor. However, when the change found in Cayman Ataxia 
patients is tested, the program no longer predicts splicing at this site - in fact, the 
calculated score is 0.03. GenScan (Lin et al., Hum. Mol. Genet. 10:137 [2001]) is an 
exon prediction program that relies on consensus sequence, not coding frame, correctly 

25 predicts the 102 bp exon. However, when the Cayman mutant sequence is tested, a 133 
bp exon is predicted which reads significantly into intron 9, with a predicted stop after 10 
novel amino acids. Thus, two methods predict that the splice mutation severely interferes 
with splicing. 

30 in-situ hybridization: 
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Fresh frozen brain tissues were sectioned at 15 microns on a Leica CM 1900. 
Embryo sections, prepared from timed pregnancies and sectioned similarly, were 
purchased from the University of Michigan Morphology Core. Slides were fixed in 4% 
paraformaldehyde. Sections were washed for 1 hr in 2x SSC, acetylated for 10 min at 
5 room temperature with 0.25% acetic anhydride in 0.1 M triethanolamine (pH 8.0), and 
dehydrated through graded ethanol. Sections were then incubated overnight at 55°C with 
~1 x 10 6 dpm of probe in 30 jil of hybridization buffer (50% formamide, 3X SSC, lx 
Denhardt's solution, 200 ng/ml tRNA, 50 mM sodium phosphate buffer pH 7.4, 10% 
dextran sulfate (70 kD average MW), and 10 mM DTT). The next day, slides were 

10 washed three times in 2x SSC, treated with RNase (200 ng/ml in 10 mM Tris/HCl, pH 
8.0, 500 mM NaCl) for 1 hr at 37°C, washed in 0.1X SSC at 70°C for 1 hr, and 
dehydrated through graded ethanol before exposure to BioMax MS film (Eastman Kodak 
Company, Rochester, NY, USA). 

Emulsion Autoradiography and counterstaining was conducted to better interpret 

15 the exact location. Slides were dipped in Kodak NTB-2nuclear emulsion and exposed for 
one week. Slides were developed in D-19 (Eastman Kodak) for 3 minutes, and slightly 
counterstained with cresyl violet, dehydrated and coverslipped in Permount. 

The probe for in-situ was prepared from clone UI-M-BHO-ajo-f-05-0-UI.sl, 
which contains exons 5-13 purchased from Research Genetics and sequence verified. 

20 Plasmid DNA was digested with Hind III for sense and Bgl I for antisense probe. After 
phenol extraction and ethanol precipitation, T3 (antisense) or T7 (sense) RNA 
polymerase reactions were performed on 1 \ig linearized plasmid DNA template for 90 
min. at 37°C in transcription buffer,' with 0.01 M DTT, S 35 UTP, 0.5 mM each of ATP, 
GTP, CTP, and Rnasin. The reaction was stopped by the addition of RNAse free DNAse, 

25 and products purified by spin columns (Micro Bio-Spin 30, Biorad, Hercules, CA). After 
addition of DTT to 0.1 M, cpm were determined. 

Histopathology: 

Two jittery, two sidewinder, and five control littermate mice, aged 3 weeks +/- 3 
30 days were processed for histology. Mice were euthanized by decapitation, the brains 
extracted from the cranial vault and immersion fixed in 4% paraformaldehyde - 0.1M 
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phosphate buffer (pH 7.4). Brains were fixed for 24 to 48 hours and embedded in 
paraffin. Six-micron thick sections were cut on a rotary microtome in the horizontal 
plane, mounted on glass slides, and deparafinized and rehydrated. Sections were stained 
with 0.5 % cresyl violet for routine histology. TUNEL labeling was assessed with an 
5 ApopTag Plus kit (Intergen Co., Purchase, NY) according to the manufacturer's 
directions. Reactive gliosis was assessed with glial fibrillary acidic protein (GFAP) 
immunohistochemistry with a commercially available polyclonal antibody (1:250, Dako, 
Carpinteria, CA) and the ABC method using a commercially available kit (Vector, 
Carpinteria, CA). 

10 

B. Results 

The Cayman ataxia locus, ATCAY, was previously mapped to a >2.0 Mb region 
on 19pl3.3 (Nystuen et al., supra). Comparative mapping suggested that the mouse 

15 mutation jittery, which maps to the homologous region on mouse Chromosome 10 

(Kapfhamer et al., supra), is the murine homologue of ATCAY. While hesitant animals 
show mild ataxia and dystonia but normal fertility and life expectancy, jittery mice have 
severe truncal and limb ataxia and die of dehydration and starvation by 3-4 weeks of age, 
but both are allelic (Kapfhamer et al., supra). Continuation of a previously described 

20 cross narrowed the critical interval to 800 kb (Figure 12). Similarly, the Cayman Ataxia 
locus was narrowed further through additional obligate recombinants (Figure 14). The 
critical intervals of Cayman Ataxia and jittery overlapped in a region spanning 150 kb in 
mouse and contained nine known or predicted genes (Fig. 5). 

Because many spontaneous mouse mutations are caused by deletions or insertions 

25 (Kazazian et al., Nat. Genet. 19:19 [1998]), all genes from the critical region were 
screened by Southern blot hybridization. One probe for an unknown predicted gene 
showed DNA changes to controls with both jittery and hesitant alleles. In-silico 
restriction mapping predicted the locations of the mutations as intron 1 (hesitant) and 
exon 4 (jittery), respectively. Genomic PCRs showed larger products in each case. 

30 Sequence analysis of the aberrant PCR products showed an IAP element insertion in 

intron 1 causes hesitant, and a Bl element insertion in exon 4 in jittery. The mutation in 
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a third allele, sidewinder, ji sw , which is similar in phenotype to jittery and was identified 
at the Jackson Laboratory, was identified by exon sequencing as a two bp deletion in 
exon 5. The insertion in exon 4 of jittery is the first mouse mutation caused by Alu- 
related mutagenesis, although this mechanism is common in humans (Kazazian et al., 
5 supra). 

The IAP insertion in hesitant results in aberrant, much larger transcripts and small 
amounts of normal transcript, explaining why hesitant is only mildly affected. Jittery 
shows reduced amounts of mRNA, with the predominant species about 190 bp larger due 
to Bl element insertion. The Bl element insertion predicts a truncated, not likely 

10 functional protein of only 62 (+ 21 missense) instead of the normal 372 amino acids. An 
additional rare smaller ji transcript is visible on Northern blots and by RT-PCR. 
Sequencing shows that the 222 bp exon 4 is skipped entirely in this minor product, which 
thus restores the open reading frame, with the predicted protein missing 74 amino acids. 
This process is likely due to nonsense-mediated altered splicing, as has been found in 

15 other mutations (Lui et al., Nat. Genet. 27:55 [2001]). The sidewinder jf wd allele is 
predicted to result in a shortened protein of 1 8 1 (+1 9 missense) amino acids. 

In DNA from Cayman Ataxia patients, sequencing of exons and exon-intron 
boundaries identified two homozygous sequence variants: a G to C change in exon 9, 
predicting a serine to arginine substitution at amino acid 301, and a T to G substitution in 

20 the third base of intron 9. Both mutations completely segregated with the disorder/carrier 
status in over 40 family members that were genotyped blindly. None of over 1000 
chromosomes from Caucasian, British, Jamaican, or African control samples showed 
either of the two mutations. These mutations are thus in complete linkage disequilibrium 
with each other. The serine that is changed to arginine is conserved between human, 

25 mouse and rat, but not across the different paralogues of the protein, and not predicted to 
interfere with the structure of the protein. In contrast, analysis of the splice mutation 
predicts aberrant splicing resulting in a truncated protein that would miss most of the 
most conserved domain of the protein, and is likely the disease-causing mutation. The 
present invention is not limited to a particular mechanism. Indeed, and understanding of 

30 the mechanism is not necessary to practice the present invention. Nonetheless, it is 

contemplated that, like in hesitant, some normal splicing may remain, and both mutations 
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may contribute ,0 the phenotype. These results stroug.y argue that the Caymau Ataxra 
^ATCAYIMcay, and the corresponding protein CAYTAXIN, have been identified. 

Caytaxin is expressed exclusively in neuronal tissues. Northern blot analysts 
showed strong expression restricted to brain with nee main transcripts owing to dtfferen. 
i polyadenylation sites. In sin. hybridization to adult mouse brain showed strong 

expression in virtually all parts of me brain, including cortex, cerebellum and olfactory 
bulbs In the whole embryo, expression was also completely restricted to neuronal 
.issues, including brain, dorsal root ganglia and enteric nervous system. Externa, bratn 
morphology and brain weights of mutants were normal, and routine . histology showed 
0 normal regional and cellular architecture. The present invention is not tanned to a 
particular mechanism. Indeed, an understanding of the mechanism is no, necessary to 
understand the present invention. Nonetheless, it is contemplated that, based on the 
above findings, that caytaxin does not effect normal brain development. 

CAYTAXIN has sequence similarity to a protein called BN1P2 (See below and 
,5 Figure 13), which has been implicated in apop.osis (Boyd ct a.., Cell 79:341 [1994]).. 
There was no evidence of reactive astrocytoma with GFAP immunohistochemrstry or 
increased TUNEL staining. The present invention is not limited to a parttcular 
mechanism. Indeed, art understanding of the mechanism is no. necessary to understand 
the present invention. Nonetheless, it is contemplated drat increased apoptosis is unhkely 
20 to be responsible for the phenotype. 

Sequence analysis of caytaxin showed similarities to two other proteins, BNIP2 
and KIAA0367, as well as homology in the C-tetminal end to a Secl4 domatn w«h a 
CRAL-TRIO domain (Figure 13). This domain, named after cellular retinal and the 
TRIO guanine exchange factor, binds small lipophilic molecules, such as retinal, vttanun 
2S E and squalene, a cholesterol precursor (Stocker e. a.., Stnrcture (Canrb) 10:1533 [2002] 
as we,, as inosito, and related molecules (Sha e, a.., Nature 39, :506 [1998]). Mutatron of 
another protein containing a CRAL-TRIO domain, TTPA, which transports vttamm E, 
causes the rate Friedreieh-like ataxia with selective vitamin E deficiency, which responds 
to targe dosesof vitamin E (Ouhchi e, a.., Nat Gene,. 9:141 [1995]). Comparison of a 
30 tirree-dimensiona, mode, of caytaxin witir ,ha, of TTPA suggests flra, tire ligand-btndtng 
pocke, of caytaxin is more polar titan ma, of TTPA. In addition, feeding vitamtn E (17 
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times the normal amount in food) to hesitant mice for one month did not improve the 
phenotype. The present invention is not limited to a particular mechanism. Indeed, an 
understanding of the mechanism is not necessary to practice the present invention. 
Nonetheless, it is contemplated that a ligand different from vitamin E binds to caytaxin. 

5 

All publications and patents mentioned in the above specification are herein 
incorporated by reference. Various modifications and variations of the described method 
and system of the invention will be apparent to those skilled in the art without departing 
from the scope and spirit of the invention. Although the invention has been described in 
10 connection with specific preferred embodiments, it should be understood that the 

invention as claimed should not be unduly limited to such specific embodiments. Indeed, 
various modifications of the described modes for carrying out the invention that are 
obvious to those skilled in molecular biology, genetics, or related fields are intended to be 
within the scope of the following claims. 

15 
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